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О.1. Explain the basics of, computer vision and i 


ii mage processing (CVIP). 
Ans. Computer vision can be defined as a sci 


Basically, computer vision is a combination of image processing and pattern 
Tecognition. The output of the computer vision process is image understanding. 
Development of this field is done by adapting the ability of human vision in 
taking Information. Computer vision is the discipline of extracting information 
from Images, as opposed to computer graphics. The development of computer 
Vision 1s dependent on the computer technology system, whether about image 
quality improvement or image recognition. There is an overlap with image 
ne on basic techniques, and some authors use both terms 
ae ^d ud The main purpose of computer vision is to create models and 
к ata and information from images, while image processing is about 
E menting computational transformations for images, such as sharpening, 
a * oe others. It also has similar meaning and sometimes overlapping 
E dd uman and computer interaction (HCI). Its coverage focus on more 

. C CSIBD, interface and all aspects of technologies related to interaction 
aan human and computer. HCI is then developed as a separate discipline 
1$ the field of inerdisciplinary science) which discusses the: 
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n human-computer mediated by technology 
nan aspects. Functionally, computer vision and 


ame, with the aim of interpreting spatial data i.e, data 
human vision б А dimension. However, computer vision cannot be 
inde Ure ki human eye. 
expected to replicate just e vision system has limited performance and 
о айну з Pix eye. Even though many scholars have Droposed 
function compared to к techniques to replicate human eye, however, in 
eu limitations of the performance of computer Vision 
many cases, there are үк сын in their technique is the sensitivity of the 
KA sod on ds the algorithm, and the accuracy of the results. It 
ade e ent of performance evaluation of computer vision 
пов оча ыш evaluation involves measuring some of 
э vec es Fa algorithm to achieve accuracy, strength, or 
a arpa vacant and Шык system performance. The computer Vision 
Ri udis of multiple scientific fields is shown in fig. 1.1. 
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Fig. 1.1 Computer Vision 
Q.2. What is digital image Processing? — D 
Ans. Generally, digital image Processing refers to processing of à 2- 
picture by a digital computer, ' data. 
In case of broader context, it implies digital processing of any 2-D sing, 
DIP has a broad spectrum of applications, like radar, sonar, medical pm 
image transmission and Storage for business applications, acoustic 1" 


utomale 
Processing, remote sensing via satellites and other spacecrafts and auto 
inspection of industrial Parts. a 


Й 


Unit-] 5 
There are two principal of image processing — 
(i) Improving image quality., " 
(ii) Machine Perception of visua] information. 
For performing image processing, 
first digitize a picture into an image file, 
Then digital technique is applied to 
rearrange image puts, to improve the 
quality of shading or to improve colour 
separations. An example of enhance the 
quality of a picture is appeared in fig. 
1.2. Similar techniques are employed to 
analysis galaxies images, 


Blurred Image 


MP 07 2947 


Clear Image 


Fig. 1.2 Digital Image Processing 
0.3. Describe history of CVIP. 


Ans. In between 1960-1970, when computer vision first started out in 
the early 1970’s, it was viewed as the visual Perception component of an 
ambitious agenda to mimic human intelligence and to endow robots with 
intelligent behaviour. At that time, it was believed by some of the early pioneers 
of artificial intelligence and robotics that solving the visual input problem would 
be an easy Step along the path of solving more difficult problems such as 
higher level reasoning and planning. According to one well known Story, in 
1966, Marvin Minsky at MIT asked his undergraduate student Gerald Jay 
Sussman to spend the summer linking a camera to a computer and getting the 
computer to describe what it saw. 


In 1980s to 1990s, a lot of attention was focused on more sophisticated 
mathematical techniques for performing quantitative image and scene analysis. 
Image pyramids started being widely used to perform tasks such as image 
blending and coarse to fine correspondence search. 


In 1990s to 2000s, while a lot ofthe previously mentioned topics continued 
to be explored, a few of them became significantly more active. A burst of 
activity in using projective invariants for recognition evolved into a concentrated 
effort to solve the structure from motion problem. 


From 2000s to till today, this past decade has continued to see a deepening 
interplay between the vision and graphics fields. In particular, many of the 
lopics introduced under the rubric of image-based rendering such as image 
Stitching, light-field capture and rendering etc. 
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puter vision is shown in fig. 1.3. 


1970s to 1980s 


э DIP (Digital Image Processing) 
e Blocks world, line labeling 
e Generalized cylinders 

e Pictorial structures 

e Stereo correspondence 

e Intrinsic images 

e Optical flow . 

e Structure from motion 


1980s to 1990s 


e Imagepyramids 
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e Regularization А 
• Physically based modeling 
e Markov random fields 


e Kalman filters 
e 3D range data processing 


1990s to 2000s 


e Projective invariants 


e Graph cuts 
e Particle filtering 

e Energy based segmentation 

e Face recognition and detection 
e Subspace methods 


e Image based modeling and ren- 
dering i 

e Texture synthesis and inpaint- 
ing 

* Computational photography 

e Feature-based recognition 

e MRF inference algorithms 


* Category recognition 
* Learning 


Fig. 1.3 Most Active Topics in CVIP 
` 04. Explain in detail about the operational step of СР. 


Ans. Computer vision works by using algorithm and optica 
stimulate human visualization in order to automatically 


. " prs th 
information from an object. Compared to conventional methods 


15100 ћ 


simulated human visualization. It also combined with li 
facilitate image acquisition continued with image analysis- 
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long time and require complex laboratory analysis, computer VIS" once) К 


expanded into a branch of artificial intelligence (artificial p 
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Image processing involves a series of image operations to enhance the 
quality of a digital image so as to remove defects such as geometric distortion, 
improper focus, repetitive noise, non-uniform lighting and camera motion. Image 
analysis is the process of distinguishing the objects from the background and 
producing quantitative information, which is subsequently used for decision > 
making. Processing and analysis can be performed on many different types of 
image data. These include, in an increasing order of complexity such as, binary 
images, grayscale, color, polarized-light, multi-spectral and hyper-spectral, 3D 
images, multi-sensor and multimedia systems, and image sequences and video. 
The image processing consist three levels of processing, viz. (i) low level 
processing which includes image acquisition and pre-processing of image, 
(ii) intermediate level processing which involves image segmentation, image 
representation and description, and (iii) high level processing which involves 
recognition of ROIs (regions of interests) and interpretation for quality sorting 
and grading. The terms machine vision or computer vision is often used for 
the entire subject, including image processing and analysis and pattern 5 
recognition techniques. Hence, the process of making a decision involves а 
number of steps in sequential order. Not all situations require all of these steps 
or operations, but all are potentially available to deal with particular problems. 


Computer vision generally consists of the following five steps or operations + 
as shown in fig. 1.4. First, image acquisition operations to convert images into 
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Fig. 1.4 The Operational Steps for CVIP 
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Second, pre-proce 
imensions as th 


ssing operations to obtain an improved ir 
е original image. Third, image Segment 
ion a digital image into disjoint and non-overlapping regions 
ion a 0181 t operations to measure the characteristic. of 
кк ce and texture; and fifth, Classification M 
by classifying them into different group 


ma ge 


digital form. ation 


with the same dit 

operations to partit 

Fourth, object meas shape r 
= T SIZe, py? 

objects, s to identify objects 3 

sorting ope 


lain some examples of application using digital image processing. 
0.5. Explain 


Some examples of application using digital image processing are aş 
Ans. Som 


follows — . VM 
ing — X-ray imaging is perhaps the most familiar 
| (i) | ae ae {нё ited sources of EM radiation used for 
type of imaging. s x -rays is medical diagnostics, but they also are used 
imaging. The шы and other areas like astronomy. X-ray for medical 
extensively in in rated using X-ray tube, which is a vacuum tube with a 
imaging x pues The cathode is heated, causing free electrons to be 
vn ma electrons flow at high speed to the positively charged anode, 
S electrons strike a nucleus, energy is released in the form of X-ray 


radiation. | 

(ii) In Health Care — Several medical tools use image processing 
for various purposes, such as image enhancement, image compression, object 
recognition, etc. X-radiation (X-rays), computed tomography scan (CT ns 
positron-emission tomography (PET), single-photon emission computat 
tomography (SPECT), nuclear magnetic resonance (NMR) is ai 
ultra-sonography are some popular pieces of medical equipment based on 
image processing. 
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Fig. 1.5 Body Area Sensor Network 
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(iii) In Agriculture — Image processing, plays a vital role in the field 
of agriculture. Various paramount tasks such as weed detection, food grading, 
harvest control and fruit picking are done automatically with the help of image 
processing. Irrigated land mapping, determination of vegetation indices, canopy 
measurement, etc., are possible with good accuracy through the use of imaging 
techniques in various spectrums, such as hyper spectral imaging, infrared, 
etc. 

(iv) In Weather Forecasting — Image processing also plays a crucial 
role in weather forecasting, such as prediction of rainfall, hailstorms, flooding. 
Meteorological radars are widely used to detect rainfall clouds and, based on 
this information, system predict immediate rainfall intensity. 


(v) In Photography and Film — Retouched and spliced photos are 
extensively used in newspapers and magazines for the purpose of picture 
quality enhancement. In movies, many complex scenes are created with image 
and video editing tools which are based on image and video processing 
operations. Image processing-based methods are used to predict the success 
of upcoming movies. For a global media and entertainment company, latent 
view extracted over 6000 movie posters from IMDB along with their metadata 
(genre, cast, production, ratings, etc.), in order to predict the movies’ success 
using image analytics. The colour schemes and objects in the movie posters 


were analyzed using machine learning (ML) algorithms and image processing 
techniques. 


(vi) In Banking and Finance — The use of image processing-based 
techniques is rapidly increasing in the field of financial services and banking. 
‘Remote deposit capture’ is a banking facility that allows customers to deposit 
checks electronically using mobile devices or scanners. The data from the 
check image is extracted and used in place of a physical check. Face detection 
is also being used in the bank customer authentication process. Some banks 
use ‘facial-biometric’ to protect sensitive information. Signature verification 
and recognition also plays a significant role in authenticating the signature of 
the customers. However, a robust system used to verify handwritten signatures 
is still in need of development. This process has many challenges because 
handwritten signatures are imprecise in nature, as their comers are not always 
sharp, lines are not perfectly straight, and curves are not necessarily smooth. 


(vii) In Forensics - Tampered documents are widely used in criminal 
and civil cases, such as contested wills, financial paper work and professional 
business documentation. Documents like passports and driving licenses are 
frequently tampered with in order to be used illegally as identification proof. 
Forensic departments have to identify the authenticity of such suspicious 
documents. Identifying document forgery becomes increasingly challenging 
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such as forensic investigation, 


jn various areas, 


medical jmaging, ic verification systems provide ahigh ley d 
;ometric verification techniques are Used for 


peoples’ movements and activities. Several banks 
employed in © ing these image processing-based video surveillance 
sired activities. 


0.6. Explain some applications of computer vision. 


ision are as follows — 
ications of computer vision Г}. 
Ans. pen Character Recognition — This is, one of the oldest 
i 

tications of computer : 
cei a te ШЕШ to read zipcodes, or license plates. 
TU (ii) Mobile Visual Search — With computer vision, We can doa 
; using an image as the query. 
search on Google using — Autonomous driving is one of the hottest 


jii) Self-driving Cars mous 
Se sion f i vision. Companies like Tesla, Google or General 


motors compete to be the first to build a fully autonomous zu "s 

Ln (v) Automatic Checkout — Amazon Go is a new kind o | im 
has no checkout. With computer vision, algorithms detect ree 
products you take and they charge you as you walk out of ihe store. " 

<= “u 0) Vision-based Interaction — Microsoft's Kinect us x 
movement in real time and allows players to interact directly with a 8 
through moves. Р ёна | 

| (vi) Augmented Reality — Yt is also a very hot field ri ы 

multiple companies are competing to provide the best mobile AR P 
Apple’s ARKit has some impressive applications. 


ght now, and 


(vii) Scene Recognition — It is possible to recognize th отри 
where a photo was taken. For instance, a photo of a landmark can be € identify 
to billions of photos on google to find the best matches. We can then 
the best match and deduce the location of the photo. 


: rs ine 
(viii) Face Detection — Yt has been used for multiple En al 
п. d 


oe 


to take better pigtures,and focus on the faces, Smile detecti 
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camera to take pictures automatically when the subject is smiling. Face 
recognition is more difficult than face detection, but with the scale of today's 
data, companies like Facebook are able to get very good performance. Finally, 


we can also use computer vision for biometrics, using unique iris pattern 
recognition or fingerprints. 


Q.7. Discuss about the image model in CVIP. 


Ans. In image processing and computer vision, several image models 
have been accepted and are in recurrent use over the last several decades. In 
these image models, the emphasis is put on image quantities. The image support 
is formed by pixels usually considered as sampled locations. With each pixelis 
associated a scalar quantity called a gray level, or a vector quantity called 
either color at the perceptual level or multispectral at the signal level. Existing 
models differ primarily in terms of how the image quantity values are formulated 
and consequently the mathematical language used in the formulation. The well 
known image models are the function, the random process, and the ordered 


set. The image is a function L, xLy — С, where Ly ={l,.. Nx} and 


Ly = (L.., NyJ, Nx x Ny is the resolution of the image, G = {0,1,....n}, where 


n is the maximal quantity and m is the number of image bands. In the case of 
a binary image n = 1, image processing has roots in data structure, graph 
theory, language theory, logic, discrete geometry, and so on. If n > 1, usually 
the image is modeled as a real function (analog image where G, Ly, Ly © R). In 
this case, function theory, functional analysis, differential equations, and 
differential geometry are the foundation. An image can also be represented as 
a collection of random variables (X (i, j) | (i, j) € Ly x Ly}. In this case, the 
probability density function, moments, sufficient statistics, time series, and 
the Markov processes are the roots. When the image is modeled as an ordered 
set, discrete mathematics and mathematical morphology are the foundation. 


Fundamentally, an image is a physical or mathematical quantity where . 


variables (image support) represent geometrical or temporal elements such 
as points, lines, surfaces, and time. Although all image models have deep 
roots in mathematics, the full functionality and role of the image support is 
not apparent and the association between the support and image quantities is 
not well defined. For a given computer vision or image processing task, no 
formal mechanism is given for the integration of physical topological, 
geometrical properties of objects and their precise functionality as part of 
the image model. Consequently, the resulting computational schemes are 
non-modular and sometimes not easy to reproduce. Our goal is to give a 
computational image model in terms ofa data structure in which it is possible 
to retrieve all objects properties at any step of the processitig to’complete a 
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ns or drawback on the efficiency ori 
as an abstract data structure in the 
the formal specification of the image variables, Mh 
on between quantities and variables that sige 
nd topological properties of objects as well as hen 
capture the matical behaviour. This abstract view of the image a, 
physical and mathe ams is defined by its attributes and a collection к 
used in capua жи The attributes аге the image support an d duae 
meaningful P the image support such as the image radiometry non 
that are assigned to ЕА my 


1) or any feature 
color and orm quantities are scalar, vector or tensor. The allowable 
e.g. E of two kinds — the operations that are problem independent 
operations 


h ad and write and those that are problem dependent such as feature 
as re 1 | 
сеа image segmentation, and image enhancement 
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Q.8. Write short note on image filtering. 
e the quality of the image. For 


mage reduces noise, blurred images can be rectified. 
linear and non-linear. Linear filter 


Ans. Image. filtering is done to improv 


example, smoothing an i 


There are broadly two types of algorithms — randn 
is achieved through convolution and Fourier multiplication whereas non-linear 


filter cannot be achieved through any of these. Its output is not the linear 
function of its input thus, its result varies in a non-intuitive manner. 


Filtering of image is an important process done in image processing. : 
can be done for noise removal, blur removal, edge detection etc. Linear an 
non-linear filters are the algorithms which are used for filtering. Right + 
should be selected for any specific purpose. If the image or input sive" r 
less amount of noise but the magnitude is high then non-linear filters are Ee 
whereas linear low-pass filter is sufficient when the input given contains ae 
in large amount but the magnitude of noise is low. Linear filters ar? the t 
frequently used filters as it is simplest and fastest. Unlike non-linear mere 
linear filtering is done through applying the algorithm on the neighbor P. y 
of the input pixels in the image. The neighbourhood pixels are identified tnr 
their locations which are relative to the input pixel. 
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Q.9. Write short note on spatial filters. 


Ans. Spatial filtering term is the filtering operations that are performed 
directly on the pixels of an image. The process consists simply of moving the 
filter mask from point to point in an image. At each point (x, y) the response of 
the filter at that point is calculated using a predefined relationship. 
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Fig. 1.6 Classification of Spatial Filter 


‘Spatial filter can be classified into (i) smoothing spatial filters and (ii) 
sharpening spatial filters. These filters can be either linear or nonlinear. In 
linear filter each pixel value in the output image is a weighted sum of the pixel 
in the neighbourhood of the corresponding pixel in the input image. Nonlinear 
filtering operation is based conditionally on the values of the pixels in the 
neighbourhood, and they do not explicitly use coefficients in the sum-of- 
products manner. Noise reduction can be achieved effectively with a nonlinear 


filter. 


Q. 10. Explain in brief about smoothing spatial filters. 


Ans. Smoothing filters are used for noise reduction and for blurring. 
Blurring.is used in preprocessing tasks like removal of small details from an 
image prior to object extraction, and bridging of small gaps in lines or curves. 
By blurring with linear and nonlinear filters, noise reduction can be 
accomplished. 


() Smoothing Linear Filters — The output of a smoothing, linear 
spatial filter is the average of the pixels contained in the neighbourhood of the 
filter mask. 


а RE e 
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The concept behind smoothing filters is straightforward. By replacing th 
every pixel in an image by the average of the intensity levels in di 
neighbourhood specified by the filter mik Los results of this process in an 
image with reduced "sharp" transitions in intensities. Due to random noise c onsists 
of sharp transitions in intensity levels, the most obvious smoothing application i 
noise reduction. Although, sharp intensity transitions characterize edges, So 
averaging filters have the undesirable side effect that they blur edges, Another 
application of this type of process is that the smoothing of false Contours which 
result from using an insufficient number of intensity levels. A main use of ay 
filters is in the reduction of irrelevant detail in an image. Irrelevant mea; 
s that are small with respect to the size of the filter mask. 


value of 


eraging 
ns pixel 
region 

(ii) Order-statistic (Nonlinear) Fi ilters - These filters are non] inear 
spatial filters whose response is based on ordering the pixels contained in the 
image area encompassed by the filter, and then replacing the value of the center 
pixel with the value determined by the ranking result. In this category, the best. 
known filter is the median filter, which, as its name implies, replaces the value of 
a pixel by the median of the intensity values in the neighbourhood of that pixel, 
Median filters are quite popular because they provide excellent noise-reduction 
capabilities with considerably less blurring as compare to linear smoothing filters 
of similar size for certain types of random noise. Median filters are particularly 
effective in the presence of impulse noise, also called salt and pepper noise 
because of its appearance as white and black dots superimposed on an image. 


0.11. Discuss the various types of smoothing linear filters. 
Ans. The various types of smoothing linear filter are as follows - 


(i) Mean Filter — The filter computes the value of each output 
pixel by finding the statistical mean of the neighbourhood of the corresponding 
input pixel. The following figure illustrates the local effect of the mean filter. 


Fig. 1.7 The Effect of Mean Filter 
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The statistical mean of the neighbourhood on the left is passed as the output 
value associated with the pixel at the center of the neighbourhood. 


(ii) Wiener Filter — The Wiener filter is a classic method for 
attempting to remove noise from images. It was developed (for 1D applications) 
by Norbert Wiener in the 1930’s and 1940’s. The Wiener filtering executes an 
optimal tradeoff between inverse filtering and noise smoothing. It removes the 
additive noise and inverts the blurring simultaneously. The Wiener filtering is 
optimal in terms of the mean square error. The Wiener filter has two separate 
parts, an inverse filtering part and a noise smoothing part. It not only performs 
the deconvolution by inverse filtering (high pass filtering) but also removes 
the noise with a compression operation (low pass filtering). 


age (b) Image Blurred (c) Image after (d) Image after the 
Inverse Filter Wiener Filter 


(a) Original Im 


Fig. 1.8 Wiener Filter Applied to a Noise Image | 
Image statistics vary too much from a region to another even within the 
same image. Thus, both global statistics (mean variance, etc. of the whole 
image) and loca! statistics (mean, variance, etc. of a small region or sub- 
image) are important. Wiener filtering is based on both the global statistics and 
local statistics. 


(iii) Gaussian Filter — A Gaussian filters smoothens an image by 
calculating weighted averages in a filter box. The Gaussian smoothing operator 
performs a weighted average of surrounding pixels based on the Gaussian 
distribution. It is used to remove Gaussian noise and is a realistic model of 


defocused lens. 


Fig. 1.9 Gaussian Smoothing 


D 
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This removes fine image detail and noise leaving only larger scale 
Gaussian Blurs produce a very pure smoothing effect without side e 
Gaussian Blur is distinct from other blurs in that it has a well-defineq 


| + + + 
| | different levels of detail within an image. 


| Q.12. Describe various types of nonlinear smoothing filters, 


Changes 
fects A 
effect On 


Ans. The various types of nonlinear smoothing filters are as follows 


(i) Order-statistic Filter — Order-statistics filters are nonlinear 
filters whose response is based on ordering (ranking) the pixels contai 


the image area encompassed by the filter, and then replacing the value 
el with the value determined by the ranking result. 


Spatial 
ned in 
of the 


center pix 

(a) Min and Max Filter— The minimum filter selects the smallest 

value within the pixel values and maximum filter selects the largest value Within 
of pixel values. This is accomplished by a procedure which first finds the 
minimum and maximum intensity values of all the pixels within a windowed 
region around the pixel. If the intensity of the central pixel lies within the intensity 
range spread of its neighbours, it is passed on to the output image unchanged, 


Table 1.1 The Example and Description of Max, Min and Midpoint 
Filters 


Example Filter 


Image Type Description 


The center pixel would be changed from 77 
to 219 as it is the brightest pixel within the 
current window. 


The center pixe! would be changed from 77 


to 0 as it is the darkest pixel within the 
| current window. 
The center pixel would be changed from 77 to 
109 as it is the midpoint between the brightest 
pixel 219 and the darkest pixel 0 within the 
current window. 


However, if the central pixel intensity is greater than the maximum value, 
It is set equal to the maximum value; if the central pixel intensity is less than 
the minimum value, it is set equal to the minimum value. 


(b) Median Filter — Median filter is the nonlinear filter 97 
used to remove the impulsive noise from an image. With the median filter, al 
the pixels in the neighbourhood are ranked by intensity level and the center 
pixel is replaced by that pixel which is mid-way in ranking. 
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(a) Noise Image (b) Most of the Noise is Removed 
by using Median Filter 
Fig. 1.10 


(c) Midpoint Filter — The inidpoint filter blurs the image by 
replacing each pixel with the average of the highest pixel and the lowest pixel 
(with respect to intensity) within the specified window size. 

Midpoint — (Darkest + Lightest)/2 


(ii) Rank-order Filters — Rank-order filters are spatial-domain 
nonlinear filters, which are based on the correction of the local histogram within 
the filtering window. Rank-order filters are adaptive to the signal local statistics. 
Image processing with rank-order filters is reduced to the creation of the filtering 
interval from the limited number of pixels belonging to the filtering window with 
the further correction of the central pixel within the window using some kind of 
averaging of the selected pixels. There are three rank-order filters — Rank-order 
EV filter, Rank-order ER filter and Rank-order KNV filter. 


(a) Rank-order EV Filter — A filtering interval for this filter is 
composed of all brightness values belonging to the pixels within the filtering 
window whose absolute difference from the central pixel brightness value is 
less than or equal to EV (which is a main control parameter for this filter). The 
most important property of this filter is that it smoothens the brightness jumps 
which are less than or equal to EV, and preserves the brightness jumps that are 
greater than EV. The EV filter is highly effective for reduction of white noise. 


(b) Rank-order ER Filter — A filtering interval for this filter is 
composed of all brightness values belonging to the pixels within the filtering 


` window whose rank difference from the central pixel brightness value rank in 


the variational series is less than or equal to ER, which is main control parameter 
for this filter. This filter is effective for the reduction of complicated noise 
types with unknown statistics, and for the reduction of any complicated noise 
containing an impulsive component. 
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(c) Rank-order KNV Filter — A filtering interval for this fil 

fthe number of brightness values (belonging to the pixel wn 
the filtering window) which is equal to KNV and whose values are Closest i 
the central pixel brightness value (KNV is a main control parameter for 
portant property of this filter is that it smoothens only de 


filter). The most im 
objects whose area is less than the number of square pixels that are equal t 
KNV, and preserve the objects whose area is greater than KNV. ° 


0.13. Discuss about median filtering. (R.GRV, June 29 17) 


Ans. The best known filter in order statistical nonlinear filters are know 
as median filters. Turkey introduces the concept of a median filter in 1997 
which utilizing the median of the neighbourhood, to smoothen the image. Pratt 
extended this concept to two dimensional images. To determine each pixel 
value in the processed image following tasks are performed by the Median 


is composed о 


filters — 
(i) Inthe original image, all pixels in the neighbourhood of the pixel 


which are identified by the mask are sorted in descending or ascending order 
(ii) For the processed image, sorted median value is calculated and 
selected as the pixel value. 

Median filter replaces the pixel value by the intensity median values in the 
neighbourhood of that pixel. For certain types of random noise, median filters 
have very good noise-reduction capabilities with considerably, minimum blurring 
as compared to similar size linear smoothing filters. In the presence of impulse 
noise, median filters are specially effective they are also known as salt-and- 
pepper noise because of its appearance as black and white dots superimposed 
on an image. The median Ё of a set of values is such that half the values in the 
set are equal or less than to & and half are equal or greater than to &. For 
performing median filtering at a point in an image, first sort the pixel values in 
the neighbourhood than find their median, and finally, assign that value to the 
corresponding pixel in the filtered image. For instance, the median is the 5s 

largest value in a3 x 3 neighbourhood and it is the 13" largest value in a5 х 
5 neighbourhood. All equal values are grouped, when several values in а 
neighbourhood are the same. For instance, assume that a 5 x 5 neighbourhood 
has values (10, 20, 20, 20, 15, 20, 20, 25, 100, 12, 13, 14, 16, 17, 18, 19,2 
22, 23, 24, 26, 27, 28, 29, 30). These values are sorted as (10, 12, 13, 14, 15, 
16, 17, 18, 19, 20, 20, 20, 20, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 100) 
which results in a median of 20. Hence, the median filters are used to force 
points with distinct intensity levels to be their neighbours. An m х m median 
filter eliminates isolated clusters of pixels which are light or dark with therr 
neigh bour s, and whose area is less than m?/2 (one half the filter area), here the 
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word eliminated implies that forced to the median intensity of the neighbours. 
Larger clusters will be affected considerably less. 


Fig. 1.11 


Q.14. Write some properties of median filter. 
Ans. There are several properties of median filter as follows — 
(i) When the number of noise pixels in the window is larger than or 
half the number of pixels in the window, median filter performance is not good. 
(ii) Median filter is a non linear filter. Hence for two functions x(k) 
and y(k) the relation is given below — 
median {x(k) + y(k)} # median {x(k)} + median (y(k)) 
(iii) While preserving spatial resolutions, median filter can be used 
for eliminating isolated lines or pixels. 


Q.15. Explain in brief about sharpening spatial filters. 


Ans. The sharpening is used to highlight transitions in intensity. Uses of 
image sharpening vary and include applications ranging from electronic printing 
and medical imaging to industrial inspection and autonomous guidance in military 
systems. Image blurring could be accomplished in the spatial domain by pixel 
averaging in a neighbourhood. It is logical to conclude that sharpening can be 
accomplished by spatial differentiation due to averaging is analogous to 
integration. This, in fact, is the condition, and deals with various ways of 
defining and implementing operators for sharpening by digital differentiation. 
Basically, the strength of the response of a derivative operator is proportional 
to the degree of intensity discontinuity of the image at the point at which the 
operator is applied. Hence, image differentiation enhances edges and other 
discontinuities and deemphasizés areas with slowly varying intensities. ~~ 
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We consider in some detail sharpening filters which are based on n 
rder derivatives, respectively. We focus attention on one dime 
derivatives. In particular, we are interested in these derivatives beha 
areas of constant intensity, at the onset and end of discontinuities, 
intensity ramps. Such discontinuities сап be employed to model поі 
lines, and edges in an image. The behaviour of derivatives during t 
into and out of these image features also is of interest. 


In terms of differences, the derivatives ofa digital function are Specified 
There are multiple methods to define these differences. Although, it is requir 
that any definition uses for a first derivative (i) must be zero in areas of constar 
intensity (ii) at the onset of an intensity step or ramp, must be nonzero (i 
must be nonzero along ramps. Similarly, any definition of a second derivativ 
(i) in constant area, must be zero (ii) at the onset and end of an intensity ste 
or ramp, must be nonzero (iii) must be zero along ramps of constant Slope 
Since we are dealing with digital quantities whose values are finite, the maximun 
possible intensity change also is finite, and the shortest distance over whic 
that change can occur is between adjacent pixels. 


rst an, 
NSiona 
Viour į 
nd alon, 
€ point; 
Tansition 


second o 


A fundamental definition of the first-order derivative of a one dimensiona 
function f(x) is the difference given below — 


of 
a TART) a 
Here, a partial derivative is used in order to keep the notation same as if - 
we consider an image function of two variables, f(x, y), at the time of dealing 
with partial derivatives along the two spatial axes. Use of partial derivative 
does not affect in any way the nature of what we are trying to accomplish. 


Clearly, of/Ox = df/dx if there is only one variable in the function and the same 
is true for the second derivative. 


. The second-order derivative of f(x) can be defined as the difference, 
which is given below — ' / 


ad Re + f(x – 1) - 2f(x) - 
It is easily verified that these two definitions satisfy the cases stated above- 
Q.16. Discuss about the image representation. 


Ans. Image representation is the way in which images are described and 

stored in the computer. The efficiency of image processing algorithms К 
always be determined by the selection of different image representation methot! 
to a great extent. Image representation is of primary importance for obit 
“recognition and image understanding. A good representation schema shoul 
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be honest, general, brief and helpful for advanced tasks. As a fundamental 
data structure, a representation should capture the distribution of image features 
honestly and quickly, and make them accessible to higher processing layers. 


Q.17. Discuss the method of representing digital images. 


Ans. Consider a continuous image function of two continuous variables, u 
and v is f(u, v). This function is converted into a digital image by using sampling 
and quantization. Assume that the image f(x, y) is a continuous in 2D array which 
has M rows and N columns, where (x, y) are discrete coordinates. We shall use 
integer value for these discrete coordinates, for national clarity and convenience. 
Thus, the value of the digital image at the origin is f(x, y) = f(0, 0). The next 
coordinate value along the first row of the image represented as f(x, y) = f(0, 1). 
Here, the notation (0, 1) is employed to signify the second sample along the first 
row. Normally, the value of the image is indicated f(x, у), at any coordinates (х,у) 
in which x and y аге 
integers. The real plane part f(x, y) 
spanned by the coordinates 
of an images is known as 
spatial (time) domain, with 
m and n being referred to as 
spatial variables (spatial 
coordinates). 

There are three basic 
ways to represent f(x, y) 
as shown in fig. 1.12. A 
plot of the function is 
shown in fig. 1.12 (a) with 


two axes finding spatial (a) 
Origin Origin 
Dopo 
000000 000000 
00000 00000 
0000 : 0000 
000 +5.5.5° 000 
000 55, 000 
5 : 
000 гта 000 
000 11, 000 
0000 1^ 0000 
00000 : 00000 
000000 000000 
00000020--0000000 
(b) (c) 
‘, Fig. 1.12 
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indicates f(x, hs 
sone rand 
point uri atm еб values, as shown in fig. 1.12 (b). Each point in 
three Ta de value 0,0.5, or 1 when the intensity (gray level) is generalizeq 
the image , 0.2, А 
to the interval [0, 1]. As shown in fig. 1. 
converts these three values t 
representation is easily to sho 
this example, the size of fis 60 
printing the complete array shoul 
However, if developing algorithn 
i i in 
only sections of the image are pr 
1.12 (c) represents this concept graphically. | | 
For processing and algorithm development, numerical arrays are used, 
In equation form, the representation of an M x N numerical array can be 


written as 
f(0, 0) f(00) ~ f(0,N-1) 
f(1, 0) f(üL) ~ f, N-1) . 
f(x, y) = : : (i) 
f(M-1,0) f(M-L1) ~ f(M-LN-1) 
The right side is a array or matrix of real numbers. Every element of this 


element, picture element, pixel or pel. The words 


array is known as image ment, 
d to represent a digital image and its elements - 


image and pixel are employe 
40,0 ао ^" 20, N-A 

> a a T a 1, N-1 ii 

A-| 210 L1 | wil 


B H И 


[амо амі '" 4M-1,N- 


Obviously, а; = f(x = i, y = j) = fú, j), so equations (i) and (ii) ar 
matrices. 
0.18. Explain elements of digital image processing system. 
Or | 
generi 
of a ge" 


purpose image processing system and explain each component in de P 
(R.GP. V, Nov 


e similat 


Ans. The basic components consisting a typical gen 
used for digita! image processing is shown in fig. 1.13. 
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There are following components of an image processing system as given 
below — 
(i) Image sensors (ii) Specialized image processing hardware 
(iii) Computer (iv) Image processing software 
(v) Mass storage (vi) Image displays 
{vil) Hardcopy (viii) Networking 


(i) Image Sensors — Image sensing are required two elements to acquire 
digital image. The first is a physical device which is sensitive to the energy radiated 
by the object to image. The second is known as digitizer device which is converted 
the output of the physical sensing device into digital form. The sensors create an 
electrical output proportional to light intensity in a digital video camera. These 
outputs are converting into digital data with the help of digitizer. 


Network 


Image Displays 


Hardcopy . 


Specialized Image 
Processing Hardware 


Image Processing 
Software 


Image Sensors 


Problem Domain 
Fig. 1.13 Components of DIP System 


(ii) Specialized Image Processing Hardware — |t generally made 
up of the digitizer and hardware. The term hardware which is performed 
other primitive operations, like an arithmetic logic unit (ALU). ALU performs 
arithmetic and logical operations in parallel on whole images. Sometimes, this 
kind of hardware is known as front end subsystem and its most differentiating 
characteristic is speed. 


(iii) Computer — The computers are used in digital image processing 
system. This term is also known as general-purpose computer and can range 
from personal computer to a super computer. Sometimes designed computers 
are employed to obtain a required performance level in dedicated applications, 
but our interest here is on general purpose digital image processing systems. 
In this systems, almost any well-equipped personal computer-type machine 
for off-line image processing operations is appropriate. 


(iv) Image Processing Software – It made up of specialized modules 


e sy" which perform specific operations. For the user, a well-designed package also 
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bility to write code that, a aminimu һе Speciali Unit-1 25 
least one com 


ar : . ЕР ; i "n 
modules. General-purpose softw fiware packages permit the integrati n (viii) Networking — This is almost a default function in any computer 


language and more sophisticated so Not system in use today. The consideration is bandwidth in image transmission due 
y he larger amount of data inherent in i ssi icati 

those modules. » . = to tt g image processing applications. For 

(у Mass Storage Capability — t is а must p D d орлоно 4 dedicated networks, this typically is not a problem, but communications with 

where the gray (intensi y) level of each pi remote sites through the Internet are not always as efficient. Luckily, this problem 


1024 x 1024 pixels, - 106 LAT 
ge (1 MB= 10? byte) of Storage space ү is enhancing quickly as a result of optical fiber and other broadband techniques. 


ize of ima 
A needs one megabyte 


is an 8-bit quantity, di ient storage in an image pr н 
the image is uncompressed. Pe E. аад of ння vere 0.19. What are the fundamental steps in image processing ? 
Я ; 
oemi i im pite storage falls into three basic principal * Ans. There are several steps in image processing as follows — 
ing applicatio : : : TM SE 
processing app , short term storage is used. Compute, (i) Image Acquisition — Acquisition could be as easy as being 


(a) During processing, Р 

Р і iving short term sto 

ory is a technique of giving 5 i : 

а boards as known as тате buffers, which store one or more images 

M may be accessed rapidly, usually at video rates 30 frames/second. Frame (ii) Image Enhancement — it is the process of manipulating an 

iuis normally are housed in the specialized image processing hardware шщ image so that the result is more appropriate as compared to the original for a 

ou 3 particular application. The word particular is important here, because it 
illustrated in fig. 1.13. : : 

On line storage for relatively fast recall, on line storage establishes at the outset that enhancement methods are problem oriented. Hence, 

T pe on of magnetic disks or optical-me dia storage. The main for example, a procedure which is quite useful for enhancing X-ray images 

normally takes ne Е Е is frequent access to the stored data cannot be the best method for enhancing satellite images taken in the infrared 

factor characterizing on-line storage queas í ' band of the electromagnetic spectrum. 
(c) Archival storage, characterized by infrequent access, » 


i i i Image Restoration — Itis an area which also deals with improving 
Archival storage ischaracterized by massive storage equirements but infrequent (iii) 5 
ч р : Ёз ‘i r Р r NETS , theappearance of an image. Although, unlike enhancement, which is subjective, 
require for access. Magnetic tapes and optical disks housed in “juke boxes 


rage. Another is through provided an image which is already in digital form. In general, the image 
acquisition stage includes preprocessing like scaling. 


E н "adn image restoration is objective, in the sense that restoration methods tend to be 
are the usual media for archival applications. based on mathematical or probabilistic models of image degradation. 
e output is the final stage of the imag Enhancement, on the other hand, is based on human subjective preferences 


(vi) Image Displays — Imag : 
monitors are used to display the regarding what constitutes a “good” enhancement result. 


processing system. Colour television (TV) d to 
image. Monitors are driven by the image outputs and graphics display carts (iv) Color Image Processing — lt is an area that has been gaining in 


which are an integral element of the computer system. For image disp importance because of the significant increase in the use of digital images over 
applications, seldom are there needed which cannot be met by display c5 the Internet. 

* А Р cases, Е Е T ә 
available commercially as element of the computer system. For abel | (v) Wavelets — These are foundation for showing images in various 
these are implemented in the form headgear having two small displays e degrees of resolution. Particularly, this is used for image data compression 
in goggles worn by the user and it is required to have stereo displays. and for pyramidal representation, in which images are subdivided successively 


(vii) Hardcopy — t is used for recording images such as last! into smaller regions. 


printers, heat-sensitive devices, film cameras, inkjet units and digital n (vi) Compression — lt deals with methods for reducing the storage 


include optical and CD-ROM (compact disk-read only memory) disks. 10 needed to save an image, ог the bandwidth needed to transmit it. However, 


Н А Spi 
highest possible resolution is provided by the film, but paper 1S the is storage technology has improved significantly over the past decade, 
ofthe 


medium of selection for written material. When equipment of image P" 9 "m 
is employed, images are shown on film transparencies or in a digital me " 
for presentations. For presentation of image, the latter procedur 
acceptance as the standard. 


| . B . 

, Internet, that are characterized by significant pictorial content. Image 
compression is familiar to most users of computers in the form of image file 
extensions like jpg file extension used in the JPEG image compression standard. 


e is gai" 
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(vii) Morp hich are useful in the representation and q БТ 
image components w ese Outputs of These Processes are Images 
i 
shape. ; It partition an image into it 
"m entation — р i Б Ils Constit, Morphological " Wavelets and 
Le ty, autonomous segmentation 1s one of the moa wg Processing Compression LM poderes 
Ж, T sually, І ў NE. 
3 MJ or objects. ital image processing. A rugged segmentation methog АШ { 5 
e tasks in digi way toward successful solution of imaging Problen ШҮ ee 
_ \ $28 "E IL 
X T s process > an be identified individually. S Wii | E < | Segmentation | Image Restoration 
аы ; ec Šo 
b- require 0 Је 3: тле чоя - 
ix) Representation and Description — It always follow the НЕ Representation Image Filtering 
(ix, i hich usually is raw pixel data Соу Bg & Description Knowledge Base & Enláncemenr 
fa segmentation stage, WMC = „Са1а, constituting y Ё * 
9 z undary of a region or all the points in the region itself In either i ? Object Image 
b ше g the data to a form suitable for computer processing is Пес : Recognition Acquisition 
| convertin: А b is wh san 
si t be made is whether the data s : REET 
| The first decision which must hould be shoy Domain 


| as a boundary or as а complete region. Boundary representation is appro 
if the focus is on external shape characteristics, like corners and infe Ss | | 
Regional representation is appropriate if the focus is on internal properti | 0.20. Classify image representation methods based оп level of processing. 
texture or skeletal shape. These representations complement each other j Ans. Based on the level of Processing of images bya machine for different 
some applications. Selecting a representation is only part of the solution fy purposes, the image representation methods are grouped into four categories, 
transforming raw data into a form suitable for subsequent computer processin. VIZ pixel based, block based, region based and hierarchical based. 

| For describing the data, a procedure must also be specified so that features, 

| interest are high lighted. Description, also known as feature selection, Ides 

with extracting attributes which result in some quantitative information 


\ Fig. 1.14 Representation of Fundamental Steps 


Image Representation Based on 
Level of Processing 


) interest or are basic for differentiating one class of objects from another. 

| (x) Recognition — It is. the process which assigns а label b: (Pine Base) CD СУ) Hierarchien 

d object based on its descriptors. We/conclude our coverage of d я 

F processing with the development of procedures for recognition of p Fig. 1.15 Classification Based on Level of Processing 

Ha j | i ‘ing about the requi rior knowledg "P . | КИНИ . 

ү E ee z be e said nothing ed bi dide pert moi (i) Pixel Based Representation — This representation is the simplest 
[Р "pain interaction between the knowledge. base ап р bouta prob «representation to define an image. In digital imaging, a pixel, pel, or picture 
| in fig. 1.14. In the form of a knowledge database, knowledge а , element is a physical point in a raster image, or the smallest addressable element 


domain is coded into an image processing system. This knowledge ons iin an all points addressable display device. The representation includes simple 
i ezsy as detailing regions of an. image where the information of Pn jneighbourhood relations between elements. Each pixel contains only local 

i known to be located, hence limiting the search that has to be vr sinformation for each element. The number of elements in the representation is 
seeking that information, The knowledge’ base can be quite complex, normally big and is used for displaying the image and it has applications in 


interrelated list of all major possible defects'in'a materials inspect! Pr medical imaging where each pixel has got its own importance. 
Or an Image database containi | s ofa reg, 


li (ii) Block-based Representation — In this method, the image is 


i у = + 48 Em $ H roc ^ s А H n "i 
the interaction between modules to guiding the operation of үк ва’ smaller than with pixel-based, still only local information is stored which is 
modules. The difference ist ade in fig. 1,14 by the use of me opp” Same that of pixel based representations. Block based representations can be 
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(vii) Morphological Processing — |t deals with tools for extract 


` : inp 
hich are useful in the representation and description |" Outputs af These Processes are Images 


01 


image components w 


shape. Morphological Wavelets and 
Po OE Compression Multi-resolution Color Image 


see ion — It partiti n image into its constitue Process \ 
(viii) Segmentation — lt partition ar ag : constituent par, £ Processing Processing Processing 
or objects. Usually, autonomous segmentation is one of the most diff; п 28 
` Р : T a Н " z Е] 
tasks in digital image processing. A rugged segmentation method brings the E Ё 
n i і i . 9 ч 
process a long way toward successfull solution of imaging problems which $ < Segmentation Image Restoration 
require objects to be identified individually. z А 
Representation Umm 
. ut ge а= P Image Filter 
(ix) Representation and Description = It always follow the outpu Be & Description Knowledge Base & Enhancement 
of a segmentation stage, which usually is raw pixel data, constituting eithe; 2 LT 
the boundary of a region or all the points in the region itself. In either case, Recognition E 
converting the data to a form suitable for computer processing is necessary, 
se 1 i Problem 
The first decision which must be made is whether the data should be Shown Domain 


as a boundary or as a complete region. Boundary representation is appropriate 
if the focus is on external shape characteristics, like corners and inflections, 
Regional representation is appropriate if the focus is on internal properties, like 
texture or skeletal shape. These representations complement each other in 
some applications. Selecting a representation is only part of the solution for 
transforming raw data into a form suitable for subsequent computer processing, 
For describing the data, a procedure must also be specified so that features of 
interest are high lighted. Description, also known as feature selection. It deals 
with extracting attributes which result in some quantitative information of 
interest or are basic for differentiating one class of objects from another. 


Fig. 1.14 Representation of Fundamental Steps 


Q.20. Classify image representation methods based on level of processing. 


Ans. Based on the level of processing of images by a machine for different 
purposes, the image representation methods are grouped into four categories, 
viz. pixel based, block based, region based and hierarchical based. 


Image Representation Based on 
Level of Processing 


Pixel Based Block Based Region Based 


Fig. 1.15 Classification Based on Level of Processing 


Hierarchical 
Based _ 


(x) Recognition — lt is the process which assigns a label to an 
object based on its descriptors. We conclude our coverage of digital image 
processing with the development of procedures for recognition of individual 
objects. So far we have said nothing about the require for prior knowledge or 
about the interaction between the knowledge base and the processing modules (i) Pixel Based Representation — This representation is the simplest 
in fig. 1.14. In the form of a knowledge database, knowledge about a problem representation to define an image. In digital imaging, a pixel, pel, or picture 
domain is coded into an image processing system. This knowledge can be as element is a physical point in a raster image, or the smallest addressable element 
easy as detailing regions of an image where the information of interest is in anall points addressable display device. The representation includes simple 
known to be located, hence limiting the search that Һаѕ їо be conducted in Neighbourhood relations between elements. Each pixel contains only local 
seeking that information. The knowledgebase can be quite complex, like an information for each element. The number of elements in the representation is 
interrelated list of all major possible defects іп а materials inspection problem normally big and is used for displaying the image and it has applications in 
oran image database containing high-resolution satellite images of a region in medical imaging where each pixel has got its own importance. 
connection with change detection applications. The knowledge base controls (ii) Block-based Representation — In this method, the image is 
the interaction between modules to guíding the operation of each processing divided in a set of (rectangular) array size. The number of elements is slightly 


modules. The difference is made in fig. 1.14 by the use of double headed smaller than with pixel-based, still only local information is stored which is 


wd d.t 


arrows between the processing modules and the knowledge base, as oppose! same that of pixel based representations. Block based representations can be 


th 


to single headed arrows linking the processing modules, 


Tertio -- 
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both gray-scale and binary iE age features, te! TheQME pyramid decompo- 
done for bo segmentation, extracting differ : Б кекс. sition splits the image frequency 
compression» юп Based Representation = lit is also known as Super-biy, space into three different scales 
(it ) Regio the regions are not rectangular and itis formed by Broupi, and within each scale, into three 
representation. Here d pixels. The adjacency information between regions i, orientation subbands are vertical 
similar and connecte PAG (region-adjacency graph) or combinatorial map (V), horizontal (H) and diagonal 
represented usually ie sed for object detection and segmentation, but differen (D). Visually each subband 
The representation 15 uin have to be considered. captures the local orientation 
unions of multiple d pom resentation- The representation uses most likely ©88У in an image. The resulting 
(iv) Hierarchic ae d representations. The image representatoy vertical, horizontal and diagonal 
unions of regions of aie Examples includes min-/max-tree, ctr се, quad subbands at scale i are denoted 
can be done at ag ы includes object detection, video segmenta, РУ Vile Y} Hi 06 У) and Di(x, 
tree, bin tree, etc. рр filtering, шаре simplification, etc. , y respec- tively. The first scale (b) The Magnitude of a Three-scale 
image segmentation and filtering, ВГ subbands are the result of con- OMF P d Decomposition of Disc 
0.21. Explain in detail about the image statistics. volving the image with a pair of у a i 
| : і decompositions, namely, ће quadrature mirror 1-0 2n + 1 tap finite impulse 8 
Ans. Two multiscale image P ; response (FIR) low-pass and Fig. 1.16 


position and the local angular harmonic 
mage statistics are collected from image 
ge using basis functions that are localized 


high-pass QMF filters denoted as /(.) and h(.) respectively. The vertical 
subband is generated by convolving the image, I(x, y), with the low-pass 
filter in the vertical direction and the high-pass filter in the horizontal direction 


filter (QMF) pyramid decom c 
decomposition (LAHD). These i 
representations that decompose an ima: 


in both spatial positions and scales, implemented as a multi-scale image antes 
decomposition. aia ч " 
i i id Decomposition — This is a firs _ _ 
Quadrature Mirror Filter Pyrami Vii: y) Y h(m) Yo mun 
multiscale image decomposition, based on separable quadrature mirror filters ae EN 


(QMF). One important reason for choosing ых on es ii Ur The horizontal subband is generated by convolving the image with the 
aliasing from the reconstructed image, making it suitable for the purp low-pass filter in the horizontal direction and the high-pass filter in the vertical 


image analysis. In fig. 1.16 ву lirection as — 

(a), an idealized frequency N N C 

domain decomposition with Я Н, (х, у) = У) Қт). > h(n) I(x-m,y-n) 
a three scale QMF pyramid x m--N  n--N 


The diagonal subband is obtained by convolving the image with the high- 
ass filter in both directions as — 


decomposition. From top to 
bottom, are scales 0, 1 and 7 
2, and from left to right, are 
the low pass, vertical hori- 
zontal and diagonal sub- 


; N N 
^ Di, у= Уһ) P hG)l(x-m.y-n) 


m=-N n=-N 
bands. And in fig. 1.16 (b), : А : : : Е 
а th the low-pass filter in both directions 
ihe imde ofa le Finally, convolving the image with the low-p 


D ‘ idue low- b = 
OMF pyramid decompts ' ' ta i enerates the residue low. Е sub өң as 
sition of a “disc” image. For 
= [ 1 =пьу 
the purpose of display, each а >н) Oe we 


subband is normalized into А ; 
range 0 to 255. (a) Three Scale QMF Pyramid Deco" 


positi 
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n Е ` 0.22, Deseri sep i 

the residual low-pass subband, The decomposition of a RGB color image is 2 escribe the fundamental step 

i h color channel indepen ү 

performed by decomposing eac : | p i represented by a pine) von ca 
bands are denoted as Vf (x,y), Hf (x, y), and О (x,y), with сє{г,р,Ь}. 


recognition system is the light intensities of each pixel 
Color images using other color Systems (e.g., HSV or CMY. K) are decomposed pixel ; : 
by first transforming to RGB colors. 


decomposition is the local angular harmonic decomposition (ГАНЫ). Е ormally, 
the n*-order local angular harmonic decomposition of an image, I(x, y), is 
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The next scale is obtained by first down-sampling the 
subband Ly and recursively filtering with А.) and h(.), as 


N 
Ух, у) = Sheen) 1001 ((к/2 ]- а улә Jn) 


residual lOW-Dasy 
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ms-N ns-N 
N N 
Ha у= Эш) YihG)Li([x/2 |- m|y;2 |) Memmi 
и m=-N  n--N б ве ey 
N N ЖҮ 
D3(x, у) = h(m) Уһ) ([x/2 |- т. |у/2 |- n) 
m=-N n=-N 
N N 
Lax, у) = dm) 2G) Li ([x/2|-m|y/2]-) Fig. 1.18 The first 4-order LAND of a Na 
m--N  n--N 


tural Image, The Top Row 


Shows the Maguitudes and the Bottom Row Shows the Phase Angles 


Subsequent scales are generated similarly by recursively decomposing 


in image recognition, 
dently, These sub- | Ans, Image recognition is usuall 


Local Angular Harmonic Decomposition — The another image 
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defined as — 


As(DG,y) = | Flay GOR() cdr а 


where I(x yy (1,0) is the polar parameterization of image I(x, y) about point 


Digital 
Image 
Data 
(x, y) in the image plane, and R(r) is an integrable radial function. The LAHD Saue 
can be regarded as a local decomposition of image structure by projecting 
onto a set of angular Fourier basis kernels, ein? The function R(r) serves as 
the local windowing function as in the Gabor filte ; Which localizes the analysis ‚ Ж 
in both the spatial and frequency domains. The output of the n-th LAHD, pre 
A,(I) (x, y), is a complex-valued 2-D signal. The magnitudes and phases of Fee Extracted image ) | Sttucture 
the first 4-order LAHD of an image is shown ` E» - | 
in fig. 1.18. Both the magnitudes and the phases 


capture image structures Such as edges, 
comers and boundaries. Note that the basis in ` 
LAHD is highly Over-complete and it is usually 


not possible to reconstruct the image from the 
decomposition, 


Matching |" Intermediate 
К 74 ' Images 


Symbolic Representation 


Fig. 1.19 Image Recognition 
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informs us of the spatial relationship between objects in à scene. The Varios 
steps required to transform iconic information into recognition information m 
shown in fig. 1.19. 

(Ù Image Formatting — This formatting means capturing an image 

by bringing it into a digital form. 

(ii) Conditioning — In an image, there are usually features Which are 
uninteresting, either because they were introduced into the image during the 
digitization process as noise, or because they form part ofa background, Ап 
observed image is composed of informative patterns modified by uninterestin g 
random variations. Conditioning suppresses, or normalizes, the uninteresting 
variations in the image, effectively highlighting the interesting parts of the 

image. 


(iii) Labeling — In this step, informative patterns in an image haye 
structure. Patterns are usually composed of adjacent pixels which share some 
Property such that it can be inferred that they are part of the same structure 
(e.g., an edge). Edge detection techniques focus on identifying Continuous 
adjacent pixels which differ greatly in intensity or colour, because these are 
likely to mark boundaries, between objects, or an object and the background, 
and hence form an edge. After the edge detection process is complete, many 
edge will have been identified. However, not all of the edges are significant, 
Thresholding filters out insignificant edges. The remaining edges are labeled, 


More complex labeling operations may involve identifying and labeling shape 
primitive and corner finding. 


or an arc. If it is a region, then the number of holes mi 
case of an arc, the average curvature of the arc might be useful to know. 
Feature extraction can also describe the topographical relationships between 


different groups. Do they touch ? Does one occlude another ? Where are. they 
in relation to each other ? еіс, 


(vi) Matching — In this Step, 
grouped into objects and the relationshi 
been determined, the final step isto reco 


Q.23. Discuss briefly about mathematical morphology. 
(R.GPV, June 201 7) 


image, the black or White pixels like a set refer to the ima 
description. x and y coordinates are the elements ofa 


0.24. Explain about basic set theory, 


Ans. Morphology is depend on set theory. Set theory includes various 


Operations such as union, intersection, complement, difference, reflection, 
translation are given below — 


(Ù) Union - P. ) Q represents union of images P and Q. P U Q 
represents the set whose elements can be elements of P and Q or either element 
of P or element of Q. The expression is written as 


P U Q= def(x|xeP or хє0} 
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(^ О represents intersection of images P ang 0. 


ii) Intersection — Р t 
o se elements are common for both image P an d 


P ^^ О represents the set who 
image Q. The expression is written as — 


P ^ Q= def(x | xeP and xeQj . 


(iii) Complement — Р© represents complement of image P. pc 


represents the set which involving everything not in image P. The expressio; 


is written as — 
pc = def (x | xePj 
(iv) Difference — P — Q represents difference between image P and 
image Q. P-+-Q represents the-set which: involves subtracts all elements of 
image P, which аге in image Q. The expression is written as — . 


P- 0 = def(x | xeP and xeQj 


* (v) Reflection- Q represents reflection of image Q. The expression 


TU 0 Q= еа аео 
(vi) Translation - (Р), represents translation of image P. The 
expression is — a ber | Si 

(P,- (dco p*zfopeP) > 


с, 


Translation 


d YAT 


, . Fig. 1.20 Image P Translation 
Q.25. Write a short note oh binary morphology 00000 s 
` Ans. Binary morphology was used as a principal method because binary 
morphology gives general routines for pattern matching and it is fast, 
memory efficient. Today, processors. are fast, memory is Cheap and 10 
year ago we can not be imaginéd but we arë using binary morphology for 
pattern matching. at, fast speed, inary. morphology is elated with sots 
operations. In a binary.image, for¢ground,pixels mean black peels fs ON 
or 1 and image background pixels mean white Pixels or off of б ай 
images, tuple or a 2-dimensional vector of the (x 


element of binary image. TT. ве 


АДАЛ 


а 
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in image processing ? 
Ans. The applications of morphological based operations are as follows — 
(i) To remove noise in the image 
(ii) To quantitative description of images 
(iii) To segment images from the background 
(iv) To enhance the image structure. 


Q.27. Give the features of morphological operations. 
Ans. There are several features of morphological operations as follows — 

(i) When the size of the structural element expands, then 
morphological operations remove information of a greater extent. 

(ii) Morphological operations uses a well-developed morphological 
algebra for representation and optimization. 

(iii) This is possible to describe digital algorithm in the form of a 
very small class of primitive morphological operations. 

(iv) White managing the stability of the important geometric 
characteristics, morphological operations give systematic alteration of the 
geometric content of an image. 

(v) Non invertibility of morphological operations characterize their 
linear transformations. 

(vi) Morphological operations use rigorous representation theorems, 


. which one may get the expression of morphological filters in form of the 


CY uA SS ne (x, y)-plane represents each . 


primitive morphological operations. 
Q.28. Explain some basic morphological operations. 


Ans. The basic operations of morphology are dilation, erosion, closing 
and opening. 

(i) Dilation — A dilation operation enlarges a region. A dilation adds 
pixels to the perimeter of each image object (sets their values to 1), filling in holes 
and broken areas, and connecting areas that are separated by spaces smaller than 
the size of the structuring element. The dilation (operator Ф ) is defined as – 

dilation (x) = x € s -Uses Xa 

(ii) Erosion — Erosion is an operator that basically removes objects 
smaller than the structuring element and removes perimeter pixels from the 
border of larger image objects (sets the pixel value to 0). If x is an image and 
s is the structuring element (mask), the erosion (operatorO) is defined as — 


erosion (x)= x9 s= Naes*-a 


where x, indicates a basic shift operation in the 
and x_, would indicate the reverse shift operation. 


direction of element ‘a’ of s 
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(iii) Opening — An opening operation (erosion then dilation) can —À S 


separate objects that are connected in a binary image. Opening generally According to this rule ero and dilation are duals with res t o th 
smoothes the contour of an object, breaks narrow isthmuses, and eliminates complement operation. ON. 5 with respect to the 


thin protrusions. Mathematically, the opening function can be described by (viii) Expansively — 
opening (x) = dilation (erosion (x)) X2 XOs (Erosion) 
or, using the operator e, XXxGOs (Dilation) 
xos = (xOs)O S : The output image of dilation operation i 
. LUE AAA А peration is expanded, so t ilation i 
(iv) Closing — The closing operation 15 defined as dilation followed expansive. But the output image of erosion ae ia Е а 
by an erosion using the same structuring element. A closing operation can erosion is anti-expansive. р саан 
close up internal holes and gaps in a region and eliminate bays along the boundary, (xi) Commutative — 
closing (х) = erosion (dilation (x)) x ®s=s@x (Erosion) 
or, using the operator », XOs-sOx (Dilation) 
хе = (xo s)Os \ E 
| ( "Я 2 zd) Q.31. What are the properties of open and close operations ? 
Q.29. What is the use of dilation and erosion ? : Ans. There are several properties of open operation as follows — 
Ans. Use of Dilation — It is used for bridging gaps. Simple structuring (i) x° s is a subset of x. 
element is used to repairing the gaps. The maximum length of the gaps is (ii) When P is a subset А 
; s of А then Р о 
called to Petwo prek 3 Q, then P os is a subset of Qos. 


E у 3 I ш) Open is increas ion. 
Use of Erosion — It is used for eliminating irrelevant detail from binary image. ЧОР е operation: Hence, 


Structuring element is helped to eliminate irrelevant detail from a binary image. i 
(iv) The open operation is anti-expansive because erosion is followed 


0.30. What is the properties of dilaton and erosion eos by dilation in open operation. Hence, 
Ans. There are several properties of dilation and erosion as follows — 
(i) Dilation and erosion are not inverses of each other. 
(ii) The dilation and erosion are translation invariant. — 


хоѕ<уоѕ 


xos<x 


(v) The opening of x equivalents to the closing of the complemented 


a А Е" : image x°. x o 5 < (x° « 5) 
If саа cx'Os Vs ad ay (vi) хоз=(хо$)о$ 
х@зсх'Ө®з Ye 2 s There are several properties of close operation as follows — 
If bcitsyOscroOs Yx EE (i) x is a subset of x » s. 

(iv) Distributivity — . ‘Ges чу ra AO PEOR | (ii) If P is a subset of Q, then P «s is a subset of О • s. 
x@(sUs')=(x@s)UK@s) ^ ^ "77 (iii) Close is increase operation. Hence, 
xO(sus)-(xOGs)n(xGs)' "e XesSyes 

(V) Iteration — "Ya MS (iv) The close operation is expansive because dilation is followed by 
(хӨѕ)Өѕ=х O(s@s') DD САС E erosion in close operation. Hence, 

(x ®s)@s'=x Ө (5 @5') t ҮЛ XSX°S 

(vi) Local Knowledge- ^" ^: е di Bi à (v) The closing of x equivalents to the opening of the complemented 
(x^z)8s-(xes)o (29s) n О E. Iu x. | 

(vii) Duality — с | x s s S (xo s) 


хсФѕ= (хӨзѕ) `` ·. EC LE (vi) X*s=(Xes)es 
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in how hit-or-miss transformation is 
jee " (R.GP.V.,, June 2015) 

Ans. A transfo 
hit-or-miss transformati 
searching structuring el 
transformation investigates th 
using two different structuring el 


rmation which is employed for template matching is known as 
on. It is a morphological operator which is used for 
ement size or local patterns of pixels. Hit-or-misg 
e inside and outside of images at the same time by 
ements. When the first structuring element 
translated to that pixel fits the image andthe second structuring translation element 
misses object, then a pixel belonging to an object is preserved by the hit-or-miss 
operation. Two template sets s and (w-s) is included by the hit-or-miss 
transformation. These two sets are disjoint. Image background is matched by 
template (w-s) and image foreground is matched by template 5. Intersection of 
the foreground erosion with s and the background erosion with (w-s) is the hit-or- 
miss transformation. Expression for the hit-or-miss transform can be written as — 


HM(x, 5) = (x @s)A[x° O(w-s)] 


Here, x = Input image 
s = Structuring element 
w = Small window which includes at least one pixel, thicker than s. 


[87] 
Ф 
Е 
" 


Structuring 
Element s 
r=sUyUz Input Image x Forground 
Foreground of the After Erosion 
Input Image x (x Os) 
(x) 


Background of the КЕ 
Input Image x Input Image x Background 
(=°) After Erosion ; 
Ix*- (x - s)] 


‘ Hit-or-miss Transformation 
of Input Image x 2 


Fig. 1.21 Process of Hit-or-miss Transformation 
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0.33. Explain morphological operation on binary image. 
Ans. There are two basic morphological operations as follows exu, 


| (i) Dilation—The process of expand the binary image from its original 
shape is known as dilation. dt is an expansion operation. It is also an expansion 
operator. It increases size of binary objects. Structuring element is used to 
determine the way of binary image expand. The size of structuring element is 
small as compare to original image, and usually the size which is seid for the 
structuring element is 3 x 3, The structuring element is reflected and shifted 
from top to bottom and from left to right at each shift, the process will look. For 
any overlapping similar pixels between the structuring element and that of binary 
image. When there is any overlapping found between pixels then pixels will be 
turned to black or 1 under the centre position of the Structuring element. 
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Gg 699a Element (s) (x 9 5) 


Fig. 1.22 Dilation 


Let us assumes as the structuring element and x as the reference image. 

The expression of dilation operation is as follows — 
xGs- (zil&, axle x} 

Here, $ = Images moved about the origin. 

The above, equation states that the outcome element z should be that there 
will be at least one element in s that intersects with an element in x when the 
structuring element dilates image x: When this is the condition, the place in which 
the structuring element is being centred on the image will be black or 1 or ON. 

(ii) Erosion — Erosion js the. process of decrease the binary image 
from its original shape. It is a thinning operation. It is also a thinning operator. 
It decreases size of binary objects, The structuring element is used to determine 
the way of binary image shrink. The size of structuring element is small as 
compare to original image, and usually the size which is used for the structuring 
element is 3 x 3. The erosion process can shift the element of structuring 
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tà Fig. 1.23 Erósion 


Let us assume s as the structuring element and x as the reference binary 
image. The expression of erosion operation is as follows — 


x O s= {z|(s} с x} | ie 


wo R 5 ER 


The above expression states that if the structuring element is a subset or 
equal to the binary image x then the output element z is taken only. : 
Q.34. Explain opening and closing operations on images by using 
suitable example. (R.GP.K,, June 2015) 
With necessary figures, explain the opening and closing operations ? 
(К.СРИ, Nov. 2018) 
Ans. Opening Operation — It is based on morphological operation 
such as erosion and dilation. This is erosion operation followed by a dilation. 
It is used to smooth the inside of the object contour, breaks Darrow strips 
and removes thin portions of the image and to remove noise and CCD 
defects in the images. The mathematical expression of Opening is written 


as — 


xos=(xOs)@s 

In words, the opening of x by s is simply the erosion of x bys, followed 
by the dilation of the result by s. Е i62 
where x represents input image and s represents structuring element ' 


4% 
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Image After Open Operation Image After Close Operation 
А id б 
Fig. 1.24 Open and Close Operations 


Closing Operation — It is opposite of the opening operation. This is 
dilation operation followed by an erosion. It is used to fill the small holes and 
gaps in a single pixels object. Closing also tends to smooth sections of contours. 
It manages the shapes and sizes of images. The mathematical expression of 
closing is written as — 

X*$— (xOs)Os 

In words, the closing of x by s is simply the dilation of x by s, followed 
by the erosion of the result by s. 

Where x represents input image and s represents structuring element. 


Original Image 


- 0.35. Explain morphological operations on gray-scale images. 

Ans. Dilation, erosion, opening and closing are basic operations of 
morphology. We use these operations to develop several basic gray-scale 
morphological algorithms. 

(i) Dilation — Gray-scale dilation of x by s, denoted x & g is defined 
as 
x ®s(m,n) = max{x(m—p,n—q)+s(p,q) 
Я | (т – р), (n- q) € Dx; (p, q) € D3} 
where, D, and Dg are the domains of x and s, respectively. 


The general effect of performing dilation ona gray scale image is twofold, 
first, if all the values of the structuring element are positive, the output image 


rr. 
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i d 
ter than the input. Second, 
oa on how their values and sha 


dark details either are reduced or 


tends to be bri pes relate to the structuring 


eliminated, dapena n 
ilation. 
= pé i i 5 — Gray-scale erosion, denoted хӨв is defined as — 
тө: (m, n)= min{x(m+p,n+q)- s(p,q) | (m + p), (n +q) 
| | €Dx;(p,q) €D,} 


i | tively. 
the domains of x and s respec | 
core = g erosion оп а gray-scale image is twofold — 


ffect of performin: y Е 
e is vbi of ш element are positive, the output image tends 


to be darker than the input image. Second, 
the effect of bright details in the input image 
that are smaller in area than the structuring 
element is reduced, with the degree of 
reduction being determined by the gray-level 
values surrounding the bright detail and by 
the shape and amplitude values of the 
structuring element itself. 

Gray-scale dilation and erosion are duals 
with respect to function complementation 
and reflection. That is 

(хӨ$)° (m, n)- (x° ®$)(m,n) 


where X^ =—x(p,q) and 8 =s(—p,-4). 
For example, take a simple-gray-scale - 
image as shown in fig. 1.25 (a). In fig. 125 
(b), the result of dilating this image with a 
‘flat top’ structuring element in the shape 
of a parallel epiped of unit height and size 
5 x 5 pixels. Dilation is expected to produce 
an image that is brighter than the original 
and in which small, dark details have been 
reduced or eliminated. In fig. 1.25 (с), the 
result of erosion is opposite effect to dilation. 
The eroded image is darker, and the size of 
small, bright features were reduced. 

(iii) Opening — The opening of 
image x by subimage s, denoted xes is 
defined as — 

xos = (хӨѕ)Ф S 

As in the binary case, opening is 
simply the erosion of x by s. Followed by 
a dilation of the result bys...) $i, neas МЭ 


„Өз + 
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The gray-scale opening operation satisfies the foll 

(a) (xos) 1x 

(b) If x, 4x, , then (хо) J (x208) 

(c) (xex)ex = xos 
The notation e-r is used to indicate that the domain ofe is a subset of the 

domain of r and also that e (p,q) « 1(p,q) for any (p, q) in the domain of e. 
(iv) Closing — The closing of x by s, denoted x.s is defined as — 
| xes = (xQ s)Os 

The closing operation satisfies the following properties — 

(а) xJ(x.s) 

` (b) If xj 4x9, then (x15) 2 (x 2 5). 

(iii) (x*s)ex = хех. 

The usefulness of these expressions is similar to that of their binary counterparts. 


The opening and closing for gray-scale images are duals with respect to 
complementation and reflection. That is 


owing properties — 


(x.s) = x^o$ 

Where x° = — x(p, q), above equation can be written as — (xes) = (-x 08). 

Opening and closing of images have a simple geometric interpretation. 
Suppose that we view an image function x(p, q) in 3-D perspective, with the 
p- and q axes being the usual spatial coordinates and the third axis being gray- 
level values. In this representation, the image appears as a discrete surface 
whose value of any point (p, q) is the value of x at those coordinates. Suppose 
that we open x by a spherical structuring element, s viewing this element as a 
“rolling ball”. Then the mechanics of opening x by s may be interpreted 
geometrically as the process of pushing the ball against the underside of the 


Fig. 1.26 Opening and Closing Gray-scale Image : id 
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face i rsed. The opening x*s then is 
uei bus part of the sy as it slides over the entire undersurface of x. 
Fig. 1.26 (a) shows a scan line of gray-scale image as a continuous function 
to simplify. And the rolling ball in various positions as shown in fig. 1.26 (b), 
Fig. 1.26 (c) shows the complete result of opening X by s along the scan line, 
And last fig. 1.26 (d) and (e) shows the result of closing x by s. 
0.36. Give some application of gray-scale morphology. 
Ans. Some various application of gray-scale morphology are as follows — 
(i) Morphological Gradient — Dilation and erosion often are used 
to compute the morphological gradient of an image, denoted g. 
g= (x8) - KOS) 
(ii) Textural Segmentation — A simple gray-scale image composed 
of two texture regions. The objective is to find the boundary between the two 


regions based on their textural content. 
(iii) Granulometry — tis a field that deals principally with determining 


the size distribution of particles in an image. 
(iv) Morphological Smoothing — One way to achieve smoothing is 
to perform a morphological opening followed by a closing. 

(v) Top-hat Transformation — This transformation which owes its 
original name to the use of a cylindrical or parallelepiped structuring element 
function with a flat top is useful for enhancing detail in the presence of shading. 

Q.37. Explain thinning operation of morphology. ; 
Ans. Thinning a binary image down to a unit-width skeleton is useful not 
only to simplify the computational methods but also to decrease the pixels 


amount, needed for shape description. The thinning operation is based on the 


hit-or-miss transformation as given below — 
XGB-X-HM(X, B) 


the surface of the highest points 


or X QB =X ^n НМ(Х, В) HN (I) 
Fig. 1.27 shows the different possibilities of structuring elements. 
Origin ы 


B Operatio" 
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The operation of thinning is divided into two types — (i) It deletes bound: 
pixels of a connected component which are neither important for inc 
the connectivity of image nor represent any significant geometrical Publi: 
an image. The operation converges if the connected skeleton does not change 
or vanishes even when the iteration operation continues. (ii) By a label ios 
the distance of the pixel to the boundary, thinning operation encodes ae 
information for every pixel of the pattern. The pixels set with local minim 
distance labels is used to derive the resulting skeleton. TP 


Q.38. Explain thickening operation of morphology. 


; Ans. Thickening is the morphological operation. It is used to grow selected 
regions of foreground pixels in binary images. This operation is somewhat such 
as dilation or closing. Following equation defines the thickening operation— 

a ХВ =X UHM (X, В) (i) 
In equation (i), X represents the input image and B represents the structurin g 
element. The thickened image comprises ofthe original image plus any additional 
foreground pixels switched on by hit or miss transform. The thickening operation 
is the dual of thinning, i.e., thinning the foreground is equivalent to thickening 
the background. This process is normally continued until it causes no further 
changes in the image. Fig. 1.28 shows the different structuring elements which 
can be used in the thickening operation. E 


Bs * B7 
Fig. 1.28 Representation of Different Structuring Elements of Thickening 
Operation 
The operation of thickening is computed by translating the origin of the 
structuring element to each possible pixel position in the image, and at each 
position comparing it with the underlying image pixels. When the black and 
white pixels in the structuring element exactly match the black and white 


pixels in the image, then the image pixel underneath the origin ofthe structuring 


element is set to black. Otherwise, it is remained unchanged means white. The 
applications of thickening operation are determining the convex hull and 
determining the skeleton by zone of influence. on “п E. 
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0.39. Explain about skeletons in morphology. 
.. Ans. As represents in fig. 1.29, the notion of 
a skeleton, S(A), ofa set A is intuitively easy. We 


deduce from this figure that — 
. (Gi) Whenz represents à point of S(A) 
and (D), is the largest disk centered at z and 


contained in A, one cannot find a larger disk - 
containing (Р), and involved in A. The disk (D); (a) Various Locations of 


is known as a maximum disk. Maximum Disks with 
(ii) The boundary of A at two or more Centers on the Skeleton of 4 
different locations is touched by the disk (D. c 
In terms of erosions and openings, the 
skeleton of A is defined. That is, it can be 
represented that 


K 

S(A)= Us (А) (i) Ф) Representation of set А 
=0 7 

with S,(A)=(A Ө KB)- (A © kB) ӨВ ...(ii) 

where B represents a structuring element, and 

(A © kB) represents k successive erosions of A, 

which are given below — 

(A Ө kB) =((....(A Ə B) Ə B) ©...) QB) Gi) (c) Representation of 


k times, and K is the last iterative step before A another Maximum Disk 
erodes to an empty set. In other words — on a Different Segment 


K = max(k(A Ө kB) # 4} (àv) of the Skeleton of A 

The formulation provided in equations (1) and 

(ii) represents that S(A) can be achieved as the 

union of the skeleton subsets S,(A). Also, it can 

be represented that A can be reconstructed from 
these subsets by using the equation — 


(d) Representation of 


K 
A= UG.) 9 kB) dv) Complete Skeleton 
k=0 Fig. 1.29 
where (SK(A) © kB) = ((...((S(A) € B) ® B) © .....) © B) (vi 
on M 


Q.40. Explain about pruning in morphology. —. 


Ans. Pruning procedures are an important compleme E nine and 
skeletonizing algorithms due to these methods tend to leave parasitic S Кол ant 


that require to be *cJeaned up” by postprocessing. In the automated recognitio! 


of hand-printed charact иы я 
BUE ton of each Ed ers, a general method is to analyze the shape of the 
Me T dutina erosi jets Spurs characterizes these skeletons. Spurs are 
rp ters ы adik y non uniformities in the strokes composing the 
Beginning with th ing this problem, a morphological method is developed. 
not exceed a spe " йаз be the length of a parasitic component does 
iion in fg P Hs ie odd of pixels. The skeleton of a hand printed “a” is 
MSN eteri m j (а). e parasitic component on the leftmost part of the 
агас! 181 ustrative of what we are interested in eliminating. By successivel 
removing end point of parasitic branch, the solution is based ee ae 
parasitic branch. Of course, this also shortens other branches in Mig и 
but, in the absence of other structural information, the assumpti ‘ pipes 
example is that any branch with three or less pixels is to be ee ш. 


of an input set A with a order of structuring elements designed to detect only 


Bl, B2, B3, B4 (Rotated 90°) 


ш 
CIT B5, вё B7, B (Rotated 90°) 


(0) 
BEAD ә. Fig. 1.30 Representation of Pruning’ 
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ts obtains the desired result. That is, let 
X, =A® {B} wth 
where {B} represents the structuring element order «ец is represents i 
figs. 1.30 (b) and (с). The order of structuring elements comprises of t 
different structures. each of which is rotated 90" for a total of eipht eleme 
The sign of x in fig. 1.30 (b) represents a “don t care" case, in the sense 
it does not matter whether the pixel in that position has a value of 0 or |. 
Various results reported in the literature on morphology are based on the use 
of a single structuring element, just like to the one in fig. 1.30 (b), but having 
“don't care" cases along the whole first column. This is not correct. For 
instance, this element would identify the point located in the eighth TOW, fourth 
column of fig. 1.30 (a) as an end point, hence, removing it and breaking 

connectivity in the stroke. 

Performing equation (i) on A three times gives the set X, in fig. 1.30 (d), 
The next step is to "restore" the character to its original form, but with the 


end poin 


parasitic branches eliminated. To do so first needs forming a set X» containing 
all end points in X, as given below — 
8 
Xj- Uc: @B*) (ii) 
k=l 


where BK represents the same end-point detectors as represented in figs. 1.30 
(b) and (c). The next step using set A as a delimiter, is dilation of the end points 
three times — 

X37 (;09H)^A (iil) 
where, H represents a 3 x 3 structuring element of 1s and the intersection 
with A is performed after each step. Like in the condition of region filling anc 
extraction of connected components, such conditional dilation prevents the 
creation of 1-valued elements outside the region of interest, like evidenced b) 
the result represented in fig. 1.30 (f). Finally, the desired result can be obtained 
by the union of X, and X5, 

Xq= X1 U X; .. (iv) 
in fig. 1.30 (g). 

In more complex scenarios, equation (iii) use sometimes picks up th: 
*tips" of some parasitic branches. This case can occur when the end points o 
these branches are near the skeleton. However, equation (i) may remove them. 
due to they are valid points in A, they can be picked up again during dilation 
Unless whole parasitic elements are picked up again, detecting and eliminating 
them is simple due to they are disconnected regions, At this juncture, а natural 
thought is that there must be simpler methods to solve this problem For 

„„instance, we could just keep track of all-deleted points and eg; } 


» ‘ 


E" 
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the appropriate points to all end points remain after application of equation. 
This option is not invalid, but the advantage of the formulation just shown is 
that the use of casy morphological constructs solved the whole problem. In 
practical conditions, if a sct of this type of tools is available, the advantage is 
that no new algorithm have to be written, The important morphological 
functions into a order of operations are easily combined. 

0.41. Explain the region growing technique. 

Ans. A technique which groups pixels or subregions into larger regions 
based on predefined criteria for growth is known as region growing technique. 
The basic technique is to begin with a set of “seed” points and from these 
grow regions by appending to each seed those neighbouring pixels that have 
predefined properties similar to the seed. Choosing a set of one or more 
beginning points can be based on the problem nature. If a priori information is 
not available, the technique is to compute at every pixel the same set of properties 
which ultimately will be used to assign pixels to regions during the growing 
process. When the result of these computations represents values clusters, 
the pixels whose properties place them near the centroid of these clusters may 
be used as seeds. 


The selection of similarity criteria depends not only on the problem under 
consideration, but also on the type of image data available. For example, the 
analysis of land-use satellite imagery depends heavily on the use of colour. To 
solve without the inherent information available in colour images, this problem 
would be significantly more difficult or even not possible. Region analysis 
must be carried out with a set of descriptors based on intensity levels and 
spatial properties, if the images are monochrome. When connectivity properties 
are not employed in the region-growing process, descriptors alone may yield 
misleading results. For example, a random arrangement of pixels is visualized 
with only three distinct intensity values. With the same intensity level, grouping 
pixels to form a “region” without paying attention to connectivity would yield 
a segmentation result which is meaningless. 


The formulation of a stopping rule is the other problem in region growing. 
If no more pixels satisfy the criteria, region growth should stop for inclusion 
in that region. Criteria like intensity values, texture, and color are local in 
nature and do not take into account the history of region growth. Additional 
criteria which increase the power of region-growing algorithm utilize the concept 
of size, likeness between a candidate pixel and the pixels grown so far, and the 
shape of the region being grown. The use of these types of descriptors is 


based on the assumption that a model of expected results is at least partially 
available, | " | 
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0.42. Write a basic region-growing algorithm based on 8-conn 

Ans. Assume that f(x, y) represents an input image array. S(x, y) re 
a seed array containing 15 at the locations of seed points and Os elsewhere , 
Q represents a predicate to be applied at each location (x, y). Assume that nd 
arrays f and S are of the same size. A basic region-growing algorithm based an 
8-connectivity is given below — ) оп 


(i) Start 
(ii) Search all connected components in S(x, y). 


€ctivir, 
Present, 


(iii) Erode each connected component to one pixel. Labe 
type of pixels found as 1. All other pixels in S are labeted 0. 


(iv) Form an image fo such that, at a pair of coordinates (x, y), let 


fo(x у)= 1 if the input image satisfies the given predicate, Q, at those Coordinates 
and otherwise, let fox, у) = 0. 


1 all this 


(v) Let g be an image formed by appending to each seed point in S 
all the 1-valued points in fg which are 8-connected to that seed point. 


(vi) Label each connected component in g with a different region 
label. This is the segmented image achieved by region growing. 
(vii) End. 


Q.43. What do you mean by zooming and shrinking of digital images ? 

22 (R.GP.V., Nov. 2019) 

Ans. Image zooming is an important process in image processing. Basically 
zooming require two steps — the creation of new pixels locations and the 
assignment of gray level to those new locations. Suppose that we have an 
image of size 500*500 pixels and we want to enlarge it 1.5 times to 750*750 
pixels. The spacing in the grid will be less than one pixel because we are fitting 
it over a small image. In order to perform gray level assignment for any point 
in the overlay, we lool for the closest pixels in the original image and assign its 


gray level to the new pixels in the grid. This method era level assi i 
called nearest neighbout interpolation. "e oo ment is 


Image shrinking is done in similar manner as d 
equivalent process of pixel replication is row and col 
we want to shrink an image by one-half; we delete every row and col 
How to present information effectively on small devices ? EA co umn. 
challenge for small-screen interface developers because viewin Is a Her 
screen is becoming more difficult in our daily lives, We must SRM 


escribed for zooming. The 
umn deletion, For example, 


ways to organize, show, and search data or results on small screen, О f 
! ‚ Опео 
ol ; 
S to enable them to 


the methods is to build zoom able users, and all need to 
control the zooming purposes. 


à ш) . d s Я "ED 
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ENTATION AND DESCRIPTION — 
HEMES, BOUNDARY DESCRIPTORS, 
REGION DESCRIPTORS 


Q.1. Write the various methods of representation. Discuss any one. 
Ans. There are following methods of representation as given below — 
(i) Boundary (border) following 
(ii) Chain codes 
(iit) Polygonal approximations using minimum perimeter polygons (MPP). 
(iv) Signature 
(v) Skeletons. 
Boundary (Border) Following — By introducing a boundary following 
algorithms whose output is an ordered sequence of points. We consider — 
() Weare working with binary images where background and object 
points are labeled 0 and 1. 
(ij) That images are padded with a border of Os to avoid the 
possibility of an object merging with the image border. 
The method is extended to multiple, disjoint regions by processing the 
regions individually. | 
Given a binary region R, the given boundary consists of the following steps E 
(i) Consider the initial point bo, be the uppermost, leftmost in the 
image which is labeled 1. Indicate by со the west neighbour of bo, Co is 
always a background point. Consider b, represent the first neighbour 
encountered whose value is 1 and c; is the point immediately proceeding bi in 
the sequence. Check the 8 neighbours of bo, beginning at co and proceeding in 
a clockwise direction. Store the locations of by and bi for use in step (v). 
(ii) Consider b = b, and c = c, as shown in fig. 2.1 vil M 
(iii) Consider the 8-neighbours of b, beginning at c an line x. 
in a clockwise direction, be represented by nj, n», ....., ng. Determine the first 


Ny labeled 1. 
(iv) Consider b = nj and c = пк. 
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(v) Repeat steps (iii) and (iv) until b = bo and the next boundary 
point computed is b}. 


Fig. 2.1 

Because ny is the first 1-valued point determined in the clockwise scan, c 
is always a background point in step (iv). Sometimes, this algorithm is also 
called the Moore boundary tracking algorithm after move. 

Q.2. Write short note on chain code representation method. 

Ans. The chain code 1 : 
representation method was 
introduced in 1961 by Freeman. 
This representation is used to 3 0 
represent a boundary by a 
connected sequence of straight-line 
segments of specified length and i E 
direction. Typically, chain code (а) 4-directional ` (b) é-directional 
representation is based on 4 or 8 Fig. 2.2 Chain Codes. 
connectivity of the segments. The i i 
each segment direction is encoded 
by using a numbering method as 
shown in fig. 2.2. A boundary code 
formed as a sequence of such 
directional numbers is referred to 
as a Freeman chain code. The » 
differential chain codes is shown (д) 4. directional 


tend cele tan BeBe MER 
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The chain code representation i i : 
Step (i) — Choose dede Rs ieee. EE 
С | rve. This point i 
its absolute coordinates in the image. Pro) ape а Ey 

Step (ii) — Every consecutive point is re i 

У М Presented by a chain code 
showing the transition need to go from the Present point to the next point on 
the curve. 

Step (iii) — If the next point is the initial point then store the lengths of the 
curves into the file. 

A variation of chain code is differential chain codes and differential chain 
code is denoted by К;. Each differential chain code K; is represented by the 
difference of the current chain code c; and the preceding code ci (i.e, K; = 
с; — с) There are two types of chain code — 

(1) 4-directional chain code 
(ii) 8-directional chain code. 

Drawbacks of Chain Code – Any small disturbances along the boundary 
due.to noise or imperfect segmentation cause change in the code that may not 
be related to the principal shape features of the boundary. 


Q.3. Describe the polygonal approximations using minimum perimeter 
polygons. 
Or 
Write short note on polynomial approximation. (R.GP.V., Nov. 2018) 


Ans. A closed curve is approximated as a 2D polygon in case of polygonal 
approximation. The approximation becomes exact if the number of segments 
of the polygon is equal to the number of points in the boundary for a closed 
boundary. Thus each pair of adjacent point defines a segment of the polygon. 
The objective of a polygonal approximation is to capture the quality of the 
shape in a given boundary using a fewest possible number of segments. This 
approximation gives a simple representation of the planar object boundary. 

Consider X = (xj, X2, ....., xy) is a set of points on the boundary of a 
planar object to be approximated using a polygon. lt is defined as a partitioning 
of the set into N mutually exclusive and collectively exhaustive subsets Àq, А, 
2, Ay Such that each of the subsets may be approximated using a linear 
sequence. Polygonal approximation is obtained by minimization of an objective 


function in the form. a 
J= Уа (х), хел; NO 
i=l | | 
where lj represents linear structure that approximates the Points i and d 
represents a measure of deviation, For image processing operations, 
approximation methods of modest complexity are well suited. One of the 
most powerful is representing a boundary by.a minimum-perimeter polygon:: 
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(MPP). It is explained in the following discussion — 
(i) Foundation (ii) MPP algorithm. 


(i) Foundation — An automatic appealing method for produci 
algorithm to calculate MPPs is to enclose a boundary [see fig. 2.4 (4)] by a 
of concatenated cells as illustrated in fig. 2.4 (b). Because it is permitte 
shrink, the rubber band will be constrained by the inner and outer walls o 
bounding region defined by the cells. The shape of а polygon of mini, 
perimeter is produced by this shrinking which circumscribes the region епс}, 
by the cell strip as shown in fig. 2.4 (c). In fig. 2.4 (c), all the vertices of 
MPP coincide with corners of either the inner and/or the outer walls, 


In, 
Seq 
the 


(a) 
Fig. 2.4 
(iii) MPP Algorithm — The set of cells enclosing a digital boundan 


is known as cellular complex. Consider the boundaries under consideration 
are not self intersecting, that leads to easily connected cellular complexes. 


Letting write (W) and black (B) denote convex and mirrored concave 
vertices, respectively. The following observations are — 


(a) The minimum perimeter polygon (MPP) bounded by a simply 
connected cellular complex, but it is not self intersecting. 


(b) Each mirrored concave vertex of the mi 
polygon is a black (B) vertex, but not every black 
a vertex of the minimum perimeter polygon. 


nimum perimete! 
(B) vertex of a boundary i: 


(c) Each and every convex vertex of the 
polygon is a white (W) vertex, but not every white (W) 
is a vertex of the minimum perimeter polygon. 

(d) All white (W) vertices are inside the 
vertices are outside the MPP. 

(e) The uppermost, leftmost vertex in 
contained in a cellular complex, but it is always a 
minimum perimeter polygon.' .'. 2 з", 


minimum perimete! 
vertex of a boundar) 


MPP and all black (B) 


а Sequence of vertice: 
White (W) vertex of th 


UL o o p 


se 
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Q.4. Explain how merging scheme is use 


d to solve the proble 
polygonal approximation. m 


Ans. Merging schemes depend on average 
to solve the problem of polygonal approximatio 
line fit of the points merged so far exceeds a preset threshold, this method is 
used to merge points along a boundary, If this situation takes rises the error 
is set to zero, line parameters are collected and rep à 
point are merged, after all point has been merged, the intersections of adjacent 
line segments from the polygon vertices, Disadvantage of this method is that 
vertices in the final approximation do not always equivalent to inflections like 
corners to corners, because until the error threshold is increased, a next line is 
not begin. 


error or other criteria is used 
n. Until the least square error 


eats the process, other 


Q.5. Explain in brief about splitting methods of polygonal approximation. 

Ans. Splitting methods are used to subdivide a segment into two sections 
until a particular criterion is satisfied. For instance, a requirement might be 
that the maximum perpendicular length from a boundary segment to the line 
joining its two end points should not more than a preset threshold. When it 
does, the point containing greatest length from the line becomes a vertex, 
hence subdividing the initial segment into two different subsegments. Advantage 
of this method is seeking prominent inflection points. The best initial points 
are the two farthest points in the boundary for the closed region boundary. 


p r 
r r 
s q x a 


Fig. 2.5 Splitting Method 


Q.6. Explain about signatures. 


Ans. A one-dimensional functional representation of a boundary is called 
a signature. There are multiple methods to generate signature. The first one is 
to plot the distance from the centroid to the boundary asa function of angle. 
It does not matter that how a signature is generated. The objective is to minimize 
the boundary representation to a one dimension function which is simpler y 
explain than the original two dimensional boundary function. Method is use 
to generate the signatures are invariant to translation, but. they do depend on 
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rotation and scaling. With respect to rotation, normalization тау be oby, Unit- И 57 
ing a task to choice the same starting point to generate tpa |. ain, | - i 
Ker vec ташлай ceti. TORTE tarp s Signat signature. This function is referred to as a histogram of tangent angle values. 
me starting point. The first one is to choice the starting point Choice i, The slope density function boundary sections which generating very quickly 
athe ыы [е сени, Thesesandl ato GDE Enom an a the Po, changing angles and reacts on strongly to the boundary sections with constant 
i : ei | i : : 
which is farthest from the centroid. Ben ay tangent angle because a histogram is a measure of values concentration. 


With respect to both axes, depend on the uniformity assumptions in к. Q.7. Discuss boundary segments. 
and which sampling is considered at equal intervals of 0, varies i i Ans. Converting a small part of a boundary into segments is very useful. 
shape output in varies in the equivalent signature values. To normalize fict) The boundary's complexity is reduced by decomposition and therefore, 
is to scale all functions so that they span the same range of values, Them i description process is simplified. If the boundary has one or more significant 
merit of this technique is simplicity. The disadvantage of this method is NES concavities which carry details of shape, this technique is attractive. For such 


of the whole function depends on maximum and minimum values т а case, the convex hull of the region enclosed by the boundary is very useful 
dependence may be a cause of significant error from object to object wh к for robust boundary decomposition. The convex hull H of an arbitrary set s is 
є the smallest convex set having s. The set difference H-s is known as the 


the shapes are noisy. A more rugged method is to divide each sam 
: ple throug deficiency D of the set 
the signature variance, let us assume t i 1 "EM" ne жаи 
"e am small which it generates ш ешр = Him equal t Fig. 2.7 represents a boundary segments by using this method. There are 
15. Variance uy two object appearing in fig. 2.7 (a). The first one represents sets and its 


provides a variable scaling factor which is a inversely proportional t ies | 
; anne ME O Varies i deficiency. The second t iti 
size and works just like done by automatic gain control. Whatever the беа» а р шоло Бе кре du 


employed, the objective is to eliminate dependency on size while Saving th the contour of s and marking the 


---+ r-- 
H 


-— 
1 


ray ae ids banaa e versus angle is not the only methoi transition points which is meet Ж E 
чыны ы E E E > ot = method for generate the signature is t into or out of a object of the s E i 
angle between a line tan ES he ж шы Point on the boundary, plot tt convex deficiency. The output 5 | 
Although the output сед differ if d P cay p uem xr soe pete аа Sees: Е. i [ 
fig. 2.6, would hold detail, about =ч nom the г(0) curves as illustrated i fig. 2.7 (b) in this case. In m 
янса, Бесапве the кош. iis ere pat shape characteristics. Fa principle, this method is not (a) Sets (b) Boundary after 
along the boundary equivalent to ie | be Constant causes straight line: dependent of boundary size and Partition 
of this technique is to employ the OE pee in the curve. A variatiot orientation. Fig. 2.7 Boundary Segments 
pas slope density function like : In real life, due to noise, variations in segmentation and digitization, digital 
1 boundaries can be irregular. These effects appear in the form of convex 
ys ow deficiencies which include small, meaningless components scattered randomly 
N 


throughout the boundary. A general method is to smooth a boundary prior to 

partitioning as compare to sort out these irregularities by postprocessing. There 

hes are several methods for this. The first one is traverse the boundary and change 
exl the coordinates of each pixel through the average coordinates of k of its 
neighbours along the boundary. This method works for small irregularities, 
but it is not easy to control and take more time to process. If value of k is 
small, it can be insufficient in some boundary segments while if value of k is 


large, it can provide extra smoothness. 
0 0.8. Write the algorithm to obtain the skeleton of the region. 
Ans. There are two algorithms to obtain the skeleton of the region as 
given below — 
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gi Unit- Il 59 
- : shape of a plane region can be p 
Thinning Algorithm — Structural s Га pli gion redu, | 
into a graph by obtaining the skeleton of'the region via a thinning algorithn " 0.9. ми do you mean by the term skeleton 2 (RGR, Now, 2019) 
representation " Ans. An important approach of representing the structural shape of a 
Thinning algorithm is as follows — plane Эп is р reduce it to a graph. This reduction iesu dr 
я Т > satisfied, then f] obtaining the skeleton of the region via a thinning algorithm. A skeleton vis 
(1) If the given below condition are satisfied, then ags a conto, called Skelectonization. ; : 
point P1 for deletion. i оп; 


(a) 2 5 NZ(PI) < 6 (b) ZO(P1) = 1 a subset of the original component. There are different 
(c) P2.P4.P6 = 0 — (d) P4.P6.P8 = 0 skeletonization methods ~ one category is based on distance t 
Here, NZ(PI) = P2 + P3 +... + P7 + P8 + P9 and NZ(PI) represents th, a specified subset of the tra 


on. The origin: 
number of non-zero neighbours of P1. component can be reconstructed from the distanc, казанны e. E 
is defined by thinning approaches; and the result of skeletonization using thi nning 
@ т: | E nu E algorithms should be a connected set of digital curves or arcs. Motivations for 
Ll Р | оч | m mn interest in skeletonization algorithms are the need to compute a reduced amount 
of data or to simplify the shape of an object in order to find features for 
LU LE BOB recognition algorithms and classifications. Additionally the transformation ofa 


component into an image show ing essential characteristics can eliminate local 
(b) NZ(PI) = 4 and 20(РІ) = 3 noise at the frontier. 


Fig. 2.8 ; һ А à ; 
Here P, value is either 1 or 0 and is the number of zero to non-zer Ru Plain some simple descriptors Munro нн 
itions in the ordered sequence P2, P3, P4, ..... Ans. The length of a boundary represents one of its simplest descriptors, 


(ii) Conditions (a) and (b) are not changed number of diagonal components Provides its exact length for a chain coded 


le, ZO(P1) = 3 and NZ(PI )»4 in é The number of pixels along a boundary provides a rough approximation of its 
fig. 2.8 (b). F - length. The number of vertical and horizontal components plus 4/2 times the 
while litions (c) and (d) are changed into (с) curve with unit spacing in both directions. 


iy. The diameter of a boundary B is specified as given below — 
(c) P2.P4.P8 = QO..." Original ani Diam(B) _ max{D(p,,p))] " 
(d) P2.P6.P8 = 0. Fig. 2.9 Thinning ij 


represents a distance measure and p; and p; are points on the boundary. 
Skeleton Algor ithms ~ The set of points whose distance from the neare: The value of the diameter and the orientation of a line segment connecting the 
"n : У maximum known as skeleton, — ; two extreme points which comprise the diameter are useful descriptors of a 

1 lart 


i recai | boundary, The minor axis of a boundary is described as the line perpendicular 
ан to the major axis, and of such length that a box passing via the outer four 

psu Points of intersection of the boundary with the two axes completely encloses 
э > 


(i) Transform of distance vot 
uy(m,n)-ug(m,n): min ; Ua, J); (i, ):А( { 


Alm, n; i, j) Кын, the boundary. The box just defined is known as the basic rectangle. The ratio 
0 ap йә Ж i Am n)4u m Of the major to the minor axis is known as the eccentricity of the boundary. 

here k = 1, 2, et нй 3 -- 55 This also is a useful descriptor. 
A(m, n; i, j) = Distance between (m, n) and (i,j) EN Curvature is defined as the rate of change of slope. Usually, achieving 


,  _Teliable measures of curvature at a point in a digital boundary is not simple 
прш because these boundaries tend to be locally “ragged”. Although, using the 
difference between the slopes of adjacent boundary segments as a descriptor 
b... +, Of curvature at the point of intersection of the segments sometimes proves 
- ^ Useful. Since the boundary is traversed in the clockwise direction, a vertex 


(iii) The skeleton is the sei of points — 
(бт, n) : шт, п) z ugli, j), A(m, 
(iv) End. ' IET 


k = Maximum thickness of the region, the гаперо 


0 
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ПГС t when the change j Unit - 11 61 
oint p is said to be part of a convex segmen ge in slope "" А | | | 
о otherwise, р is said to belong to a segment which is concn, Q.12. Explain Fourier descriptors in boundary descriptors. 
The curvature description at a point may be refined further by using limi: Ans. A K-point digital boundary in the iy 
the change of slope. These descriptors must be used with care because " xy-plane is represented in fig. 2.11. Beginning . 
interpretation depends on the length of the individual segments relati. toj, atanarbitrary point (xo, yp), coordinate pairs 2 ir s ч 
overall length of the boundary. _ (хо, уо), Gi yp. (х, Уд), ....., (Хк, y a) T iib к 
О.11. Explain in brief about shape numbers in boundary desc, ripon are тегей жзне аи Н ; ` ee. 
Ans. The shape number of boundary, based on the 4-directional eos in ne s E E + PED C 
i difference of smallest i coordinate y pressed in the form Seose 
on ea dada magnitude, Т УЕ and У(К) = yp. With this notati 
order n of a shape number is expressed as the number of digits in it theb Eu быа bere esente e» "YT i 
representation of shape number. In addition, n is even for a closed bounda Soe dinat E E es as the ves 
and value of n limits the number of possible different shapes. ' sequence of coordinates s(k) = [x(k), y(k)], 


Fig. 2.10 shows all the shapes of order 4, 6 and 8 along with their chain-cod 
representations, first differences, and corresponding shape numbers. By treatin 
the chain-code as a circular sequence, the first difference is calculated. Howere 
the first difference of a chain-code is not dependent of rotation, usually the code 
boundary depends on the orientation of the grid. One method to normalize the gri 
orientation is by aligning the chain-code grid with the sides of the basic rectangl 

Practically, for a desired shape order, the rectangle of order n is determine 
whose eccentricity best approximates that of the basic rectangle. This пе 
rectangle is used to establish the size of grid and procedure is used to achiev 
the chain code. The shape number follows from the first difference of tli 
code. However, the order of the resulting shape number equals n due to th 
way the grid spacing was chosen, boundaries with depressions comparable! 
this spacing sometimes yield shape numbers of order greater than n. In thi 
condition, a rectangle of order lower than n is specifi 


repeated until the resulting shape number is of order n. 
Order 4 : Order 6 


ed and procedurei 


Chain Code- 
Difference — 
Shape No. — 


Chain Code -< 0 
Difference- 3 
Shape No.- 0 


Fig. 2.10 Representation of d 


fork = 0, 1, 2, ....., К — 1. Each coordinate Fig- 2.11 A Digital Boundary 
pair may be treated as a complex number, and its Representation Like a 
hence!" Complex Sequence 


s(k) = x(k) + jy(k) 09) 

For k = 0, 1, 2, ....., К — 1. That is, the x-axis is treated as the real axis 

and the y-axis as the imaginary axis of a sequence of complex numbers. 

However, the interpretation of the sequence was recast, the nature of the 

boundary itself was not changed. This representation has one great advantage 
that it decreases a 2-D to a 1-D problem. 


The discrete Fourier transform of s(k) is given below — 


K-l 
а(и)= > qu Pes (ii) 
: k-0 
for u = 0, 1, 2, ....., K — 1. The complex coefficients a(u) are known as the 
Fourier descriptors of the boundary. The inverse Fourier transform of these 
coefficients restores s(k) which is given below — 


K-I К m 
s(k) “канен ій) 


fork=0, 1, 2, ....., K — 1. Assume that instead of all the Fourier coefficients, only 


| the first P coefficients are employed. This is equivalent to setting a(u) = 0 for u> 
Р. 1 in equation (iii). The following approximation to s(k) is given below — 


E j2xuk i 
(К) = F Dawe? pun 
u=0 


fork=0, 1, 2, ....., K — 1. However, to achieve each component of S(k) , only 


P terms are employed, k still ranges from 0 to K - 1. That is, a ns 
ОЁ points exists іп the approximate boundary, but not as many 


eee 
em E С pt 
ptem TIT . 
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employed in the reconstruction of each point. From the Fourier transform, th 
high-frequency components account for fine detail and global Shape i 
determined by low-frequency components. Hence, the smaller P become. 
the more detail which is lost on the boundary. A few Fourier descriptors cay 
be employed to capture the gross essence of a boundary. Due to thes, 


[^ 
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). Expressio k) =s(k — 
€ as given below — п §p(K) = s(k ko) 


PH., Sp = X(k — ko) + jy(k — kp) | 
which simply changes the Starting point of the sequence to k = ko from k= 0 


except for u = 0, which include the impulse б(и 
means redefining the sequenc 


coefficients keep shape information, this property is valuable. Hence, they cay The last entry in table 2.1 represents that a change in starti 
be used as the basis for differentiating between distinct boundary Shapes descriptors in a different Way, in the sense that the 


: 0.13. Explain in brief about statistical m 
Ans. Statistical moments describe t 


about the origin of the complex plane is accomplished by multiplying the poin points of the segment and ro tating the line ene krei img = 
by o9, Doing so to every point of s(k) rotates the whole Sequence about th coordinates of the points are Totated by the same angle, : 
origin. The rotated sequence is s(k) o9, whose Fourier descriptors are give, UHI. в 5 
below — сае ° 

K-1 mw Ku i] 

a(u) - У s(k)ePe mik ie 
k=0 . 
= a(u) o8 NO _ e 


P r 
for u = 0, 1, 2, .., K- 1. Hence, by a multiplicative constant term el! К е 
rotation simply affects all coefficients equally, (0) Representation as a 1-D Fi unchon 


Table 2.1 represents the Fourier descriptors for a boundary sequent so Fi ïg. 2.12 


s(k) which undergoes rotation, translation, scaling, and changes in startin .—. The amplitude of 8 сап be treated as.a discrete random variable v and 
point. The symbol Axy is described as Axy = Ax + jAy, hence, the notatio form an amplitude histogram p(vj), i 7 0, 1, 2, uu, AW T, where A represents 
Sk) = s(k) + Axy denotes redefining the Sequence as given below — the number of, discrete amplitude increments in which amplitude scale is divided. 
500) = [x(k) + Ax] + jly(k) + Ay] NO Then, keeping in mind that Р(у;) is an estimate of the probability of value vi 

Table 2.1 Some basic Properties of Fourier па occurring, the nÈ moment обу about its mean is given below —. 


(а) Boundary Segment 


А-1 
Transformation | Boundary ] Fourier Descriptor ү! Nur uv) = Yi + m)"p(vj) Visas (i) 
Identity s(k) | | a(u) ; і) 1 » 
Rottion | s)esQQeP |; a (a) =a(u)e® cos NN Yt pow ^o dii) 
Translation 5.00) - s(k) + Ay. Ai) a(u) +A, ву) буз, = ww fa TY 
Scaling ss(k)=as(k) as(u)= аа(и) The quantity m is recognized as the mean or average value of v and li is 
Starting point. | s,(k) -s(k-kg) ` ap (U) = a(uje-2zkou/K Tecognized as its variance. Normally, only the first few moments are needed 


to differentiate between signatures of clearly distinct shapes: 


` "An alternative method’ ізо normalize g(r) to unit area and treat it as a 


In other words, translation comprises of adding а constant ы 
ect on the descriptors histogram: In other words, g(r? isiréáted as the probability of value г occurring. 


to all coordinates in the boundary. Translation has Doe 
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In this condition, r is treated as the random variable and the moments 


K-1 ч 
ualr) = Уп - m" gn) NT 
i=0 m 
K-1 
where m= 2 gn) ivl insi 


In this notation, K is the number of points on the boundary and u(r) i 
directly related to the shape of g(r). For example, the spread of the uw. 
about the mean value of r is measured by the second moment u(r) and third 
moment u3(r) measures its symmetry with reference to the mean. 

Generally, what we have accomplished is to decrease the description task] 

to that of describing 1-D functions. However moments are by far the most! 
popular procedure, they are not the only descriptors employed for this purpose, | 
For instance, another procedure involves computing the 1-D discrete Fourier 
transform, achieving its spectrum and using the first q components of the 
spectrum to define g(r). The advantage of moments as compared to other 
method is that implementation of moments is straightforward. They also keep 
a physical interpretation of boundary shape. From fig. 2.12, the insensitivity 
of this method to rotation is clear. Size normalization, if desired, can be obtained 
by scaling the range of values of g and r. 


Q.14. Explain in brief about shape descriptors. 


Ans. Shape descriptors are a powerful tool used in wide spectrum of 
computer vision and image processing tasks like object matching, classification, 
recognition and identification. Many approaches have been developed. There 
are a number of generic shape descriptors that are capable of providing a high 
dimensionality feature vector that accurately describes specific shapes (for 
example, Fourier descriptors and moment invariants). Alternatively, other 
descriptors describe some single characteristic that is present over a variety off u 
shapes, like circularity, ellipticity, rectangularity, triangularity, rectilinearity. c 
complexity, mean curvature, symmetry, etc. Even for a single characteristic 0 
shapes there often exist many alternative measures which are sensitive to differen 
aspects of the shape. Very likely, the shape convexity 15 a shape property witl 
the largest number of different methods defined for its evaluation, The need foi 

alternative measures is caused by the fact that there is ` a e shape descripto! 
which is expected to perform efficiently in all possible applications, 
Generally speaking, there are two approaches to analy ze shapes ~ boundary | 
i the information from boundary points only) and area based 
tasa brun - 1 the shape points). It could be said that, in the past, more 
ones (wien Pe al i б to the area based methods. The area based methods} 
attention has been ae respect to noise). Although not mentioned Often, an 
are more robust (e.g. wit 


itional rea: 


hape points, ra msi Е : 
j thods are usually simpler to compute. For example, to estimate accurately 
е 


| 7 a ofa E : н : Р : 
the m shape, while the perimeter estimation is not a straightforward task. 


| Depend 


imi ge pro 


objects 5 . : : 
E. bustness is a very desirable property when we work with low quality data 
(e 5 ioisy images or low resolution images), but recently, due to progress in 


ige technology, high quality data can be provided, and the use of boundary 


0 
ho 
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son for a larger number of methods that are based on ‘interior’ 
ther than methods based on boundary points, is that area based 


iven shape, it is sufficient to enumerate the number of pixels 


o be used. | 
‘Another example would be geometric (area) moment invariants, these are 
and accurately computable from the corresponding object image, while 


“r boundary based analogues involve computation of path integrals, which 


simple to be estimated from discrete data, which are mainly used in 
cessing and computer vision applications. On the other side, the 
ү based methods are more suitable for a high precision computer 
and image processing tasks. They are able to cope much easier with 
ith partially extracted boundaries or with partially occluded objects. 


ape boundaries are represented by a significantly smaller number of pixels 


complete shapes are. Of course, there are methods which cannot be 


4g. Area of S 
C,,(S) = 


77: (Perimeter of S)? 

usly uses both boundary and interior information. This quantity indicates 
much a given shape differs from a perfect circular disc, which is 
Stood as the most compact shape. Accordingly, the highest possible 
ictness (equal to 1) is assigned to circular disc. Finally, there are methods 
е only information from specific points or specific boundary parts 


$ belonging to the convex hull of the shape considered). 


e, We focus on shape analysis techniques based on the use of a set of 
lected shape descriptors/measures. Generally speaking, a shape 
15 а quantity which relates to a particular shape characteristic, More 
certain shape measure D(S) (related to a certain shape descriptor) 
yon Planar shape S into a real number. In order to be applicable in 
Sification, recognition or identification task, any shape measure is 
PP Invariant with respect to similarity transformations (translation, 
A Also, shape measures are preferred to be given in a 
` ^n easiest way to achieve a normalized form is to apply a 
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(i) D(S) є [0,1] 
(ii) D(S) = 1 


Ans. A region area is described 


and area are so 
in which the 


the circularity ratio R, 
Зе ЗЫ 41A 


2 
“a Bs . Es -P 
where, A-represents the area of the 


dimensionless measure. Hence, com 


of the. intensity levels, the minimum 


* 


unaffected by any deformation, as lon 


when a topologicàl descriptor is des 


герїой, this property obviously will hot be affected by a str 


scaling transformation which would preserve that D(S 
interval [0, 1] [or even better through the interval (0, 1)] 
the set of bounded compact planar regions.  . 


Thus, common desirable properties of a given shape measu 


Q.15. Explain in brief aboutsome simple descriptors in regional descriptor 
perimeter of a region represents the length of its boundary. H 

ometimes employed as descriptors, they apply pri 
size of the regions of interest is invariant. A m 
these two descriptors is in measuring compactness of a 
(регітеќег)2/агеа. Aslightly different descriptor of compac 
ratio, described as the ratio of the area of a region to the 
the same perimeter. P2/4r is the area of a c 


length of its perimeter. For a circular region, the value of this 
for square" region, the value of this 


changes ; it is insensitive also to orientation, ignoring, 
errors which can be introduced in resizing and rotatin 
easy measures employed like region:descriptors inc] 
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if S satisfies a certain property (here called a 
for which, actually, the shape measure D(S) is designed. 
(ii) D(S) is invariant with respect to the similarit 
| (iv) For апу 5 > 0 there is a shape S such that 
the best possible-lower bound for D(S)). Pf 


as the number of pixels 


ircle with perimeter 


is obtained by the equation, which is given 


) varies i] 
While S var 


y transformar; 
DS) < 8 (eg, 


area of a circle havin 


CK 


Tough 
tes thro, 


re D(s) an 


yl descrip, 


Од 
0j 


in the region. Tj 
owever, perimet 
marily to conditio 
ote frequent use , 
region, described , 
tness is the circulari 


length P. Heng 
below — 


ОА 


region in question and Р represents th 


pactness is insensitive to 
of course, 


ude the me 


and maximum intensity Val 
number of pixels with values above and below the mean, = '' 


в as there is no tearing $ 


cribed Бу. the number of h 


measure is | an 
s measure is 1/4. Compactness 15! 


uniform scal 


Yo 2 


computationi 


ga digital región. Anothe 


an and media 
ues, and th 


ЕТ йе 


0.1 6. Explain in brief about topological descriptors in regio 
i "Ans." Topological properties are useful for.global descr 
in the image plane. Topology is the study of properties of 


nal descriptors 
iptions of. region 
a figure. which are 


nae É A : Or joining of tli 
figure. For example, a region with two holes is shown in fig, 


2313. Hence 
holes in th: 


interpretation in terms of the Euler 
number. A polygonal network is 


wit 
etching or rotation ` 
= чш ш 


жү УУ 


y M 


MR ten У . " " xe - Es 


Unit - 
| transformation. Usually, when the мд 
i 


region is turn or folded, then the 
number of holes will change. As 
stretching affects distance, 
topological properties do not depend 
on the notion of distance or any 
properties implicitly based on the 
concept of a distance measure, 


For region description, another 
topological property useful is the 
number of connected components. A Fig. 2.13 R 
region with. three connected compo- 
nents is represented in fig. 2.14. 


epresentation of 
With Two Holes 


а Region 


In a figure, connected compo- 
nents C and the number of holes H 
can be used. to describe the Euler 
number E — 


E -C-H...(1) 


. The Euler number is also a 
ical property. Fig. 2.15 
o inc gions, fo ae Fig. 2.14 Representation of a Region 
haye Euler numbers equal to 0 and— with Three Connected Components 
1, respectively, due to the “A” has 
опе connected component and one 
hole and the *B" one connected 
component but two holes. Regions 
shown by straight-line segments 
(known as polygonal networks), 
include a particularly easy 


Fig. 2.15 Representation of Regions 
with Euler Numbers Equal to 0 and -1 


represented by fig. 2.16. Classifying 
interior regions of this type of 
network into faces and holes is often 
important. Representating the 
number of vertices by "V", the | 
number of edges by “Q”, and the dU 
umber of aun by F gives the Fig, 2.16 A Polygonal Network is 
following relationship, known а5 Contained by Region 


KE 
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Euler formula which is given below — 
V-Q+F=C-H 
which, in view of equation (i), is equal to the Euler number — 
V-Q+F=C-H=E 
The network in fig. 2.16 has one connecte 
vertices, eleven edges, two faces. Hence, the Eu 
7-11+2=1-3=-2 
An additional feature which is use 
is given by topological descriptors. 


0.17. Write short note on texture in regional descriptors, 
Ans. An important meth 
However no formal definition 


d region, three 


holes 
s » Se 
ler number is -2: м, 


ful in characterizing regions їп as 
Cen 
i 


ч 
& 
E 
i 
& 
8 
E 

E: 
à 
o 
Ө, 
d 
© 
Е. 8 
3 
8 
B 


0.18. Derive the expression 
Ans. The 2-D mom 
M Nis given below — M-I N-I 


m= У Урук y) 


where, p = 0, 1, 2, ..... and q — 0, 1, 2, ..... are inte 


! and rotation. 


of moment invariants in regional descriptor, Colour E The three nee. ues 
Mud oup of three correspon, 
ent of order (p + q) of a digital image f(x, y) of si; E Е URS 


, of the three RGB compo i 
5 (i, in the form of a 3-D col 
x=0 y=0 


From the second and third Moments "epe 
aM we s i » à Set of Seven invariant moments 
bi 7 nj + 7102 

$27 (my - No2)? + 4nj, P 
$3 = (no – 30? + n - 103)? pe 
s = (пз + Nha)? + (2, + поз)? Ko 
N li «(1X 
5 (то ee (139 + 1) [so + 1:2? ~ 3(n2, 
Ns" + G3, — 103) M21 + ту, DG + ni 
_ = (ту + 032] (x) 

$6 = (ny – 7102) [(n39 + n5? ~ (то + 103)2] 

+ 4 (тз + 


am 2 = (Ma1 + n3] „(xii 
This set of moments is inyar; | 21 * No3)"] ..(xii) 


» Scale change, mirroring 


Ans. Assume that we are give 


> 
as a vector. 
For example, let aj, 


umn vector, a where 


| а 
gers. The Correspondin i 
central moment of order (p + q) is given below — a=] a> 
M-I N-1 tm Ё AS 
"s . Hpq = >; Xe -X)'(y-yy f(x, y) . (i| ^ Here, this vector Tepresents one common pixel in all three images. If the 
CN - x=0 y=0 АЦ, images are of size М х N, there will be a total of K = MN 3D vectors after all 
for p = 0, 1, 2, ....- and 9=0, 1, 2, 0... ° tuus , the pixels are represented in this manner. If we have n registered images, the 
m _ ma" 5p vectors will be n-dimensional ` 
where x = Land y=—OL a П 
moo mop a Кш ia ay 
The normalized central ye denoted Пра, are given dd » Bm н 
Pq hy ав | 
"pq = —— T . 
Hoo jf au o än 
p+q ; : 
where T5 9.3 +1 , 


for р+9= 2,3, ah 


The assumption IS > 9 ylumn vectors (1.е. matrices of 
5 AA i d à yl E 
] 79 fool orde 


Tn X 1). We can write them оп a line of text simply by expressing them as 
5 = (ai, 85, ..., а), where “T” is transpose. 


ith CamScanner 
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n intensity histogram. The only differen 


constructing à 
ties like the mean and variance of the 


talking about quanti 
now talk about mean vectors an 


The mean vector of the population is defined as — 


М, = E(a) 


Qa = E((a - М„)(а - М„)Т} 


vectors їп the population, and element qj of Q, is the 


when n = 1. 


Q.20. What do you mean by thresholding ? 


intensity values of the backgrounds pi 
i i pixels are outsi 
image can be obtained using a thresholding RU: 


thresholding to be effective in object-background sep, 


We can treat the vectors as random quantities, just li 


d convariance matrices of the 


Here, Ef.) is the expected value ofthe argument, and the subscr: 
that M is associated with the population of a vectors. Recall d | 
value of a vector or matrix is obtained by taking the expected y 
element. The covariance matrix of the vector population is de ш of, 

as " 


okie a; and a; of these vectors. The matrix Q, is real and symmet 
ы A . . . e 

el = а and aj are uncorrelated, their covariance is zero and, (lense : ay come from several 

=qj=0a these definitions reduce to their familiar one dimensional a, "t AM 

тру 


Ans. A binary image is obtained usi 
: à t using an appropriat i 
gray-scale image. If the intensity values of an Fes A ^i bi idem 


that interval to 1 and points outsid 

segmentation and ае = iH id [if binary уво "af k 

designed to perform this thresholding operation in i д Шетаз have bs n 

such a camera is a binary image. In most applications noci ре out 

a gray-scale image and the binary image is obtained usn rd à 
resholding. 


Thresholding is a method to convert a gray-scale 


image so that objects of interest are separated fr 
f om the back Ў s 

ground. К EE 

Ans, Global Thresholding ‘Techniqu 
‘of all possible segmentation tectiniques. 1 


that the objects and background have sufficient contra 


К ke We 
Ce is that 
Tandom " 
Tan 


diq " 


ari 
“ble scheme, these int 


Ё Unit - I 71 


» ing, А s 1 i 
B the intensity levels of cither the objects or the background. In a fixed thresholdi 
. ing 


ensity characteristics determine the value of the threshold 


dom 
V i А 
ч, Let us assume that a binary image B[x, у] is the same as a thresholded 


gray image От[х, y] which is obtained using a threshold T for the original 


_ gray image G[x. 


е EX peg, 


\ 


Here, a isn dimensional, Qa and (a — М„)(а — M4)" are matrices “tt we may use 
n x n. Element qj of Q, is the variance of aj, the ith component i : 
or d 


ü 


t { 


сопуапапсе beta 
i 


where X is a S 


- Ans. The 
(i): 
0) 


BEC) 


— the first one is Су which holds all pixels w 
ne is G, which holds pixels with values < T. 


| second o 
(iv) 


this interval, a bint 


that sets the points! .Q) 


n 


; cameras gi 
ir (vi 


image into a bind 


aration, it is necessi 


ast and that we kno used 


0.21. Write the algorithm to estimate the thresh 


level values py and p. 


^.^. (vi) Until the differenc 
, a predefined parameter То, repea 


0.22. Explain global and ad 
a Керә 


у]. Тһиз 
В[х, y] = Gr [x, y] 


Pd, 
dey where for a darker object on a lighter background 


1 ifGp;y]s T 


От [х, y] = 
ris yl 0 otherwise 


If it is known that the object intensity values are in a range [Т\, T7], then 


| ifT <G[x,y]<T, 


Gr fi, Л= 4. 
© otherwise 


A general thresholding scheme in which the intensity levels for an object 


disjoint intervals may be represented as 
1 if Gix,yleX 
0 otherwise 

et of intensity values for object components. 


olding values. 
(R.GP.V., June 2015) 


alues is as follows — 


Gr [i Л = 


‘algorithm to estimate the thresholding v 


Start - m 
Seléct an initial estimate for T(threshold). 


Segment the image using T. This will generate two sets of pixels 
ith gray level values > T and the 


For the pixels in regions G, and Gp, calculate the average gray 


Calculate а new threshold value — 
bythe 
де". 
e in T in successiv 
t steps from (iii) to (V). 


e iterations is smaller than 


i) End. - „чөйрө: лэч 
üptive thresholding techniques. 


(R.GP.V., June 2015) 


e — It is simplest and most widely 
n this method, a threshold value 


mmm mu (ON 
TS * 


Scanned with CamScanner 


72 Image Processing and Computer Vision 


of 0 is selected and given condition is imposed — 
1 ifx(k, 020 
x(k, 0 = 


0 else "i 


Equation (i) represents à full description of a binarisation algorithm 
does not define that how to choose the threshold parameter 0. The value of 
has to be chosen in an optimal way. If pixels from different segments over] 
in their use of intensities global thresholding will be affected. A method lik 
minimum-error may estimate the underlying cluster parameters and select th 
thresholds to minimize the classification error when error is due to nois, 
Variable thresholding can be used when the overlap is due to variation į 
illumination across the image. Hence, this may be considered as a form џ 


local segmentation. 
In this method, using image intensity value, a gray scale image į 


transformed into a binary image. All pixels values are greater than the glob; 
threshold values are indicated by one and remaining are indicated by zero. 


ay 


Adaptive Thresholding Technique — In this method, the thresholdin, 
operations are based on local image features. In the condition of роо 
illuminated images, local thresholding is more useful as compared to glob: 
thresholding. 


A perfectly segmentable histogram can be converted into a histogra 
which cannot be partitioned effectively through a single global threshold bya 
uneven illumination. This problem can be solved by dividing the original imag 
into subimages. After that, to segment each subimage utilize a differer 
threshold. Two problems associated with this approach are as follows — th 
first one is how to subdivide the image and the second one is how to estima: 
the threshold for each resulting subimage. Because the threshold used fo 
each pixel depends on the pixel position with respect to subimages, this ty? 
of thresholding is known as adaptive thresholding. 


Input Global Thresholding Results 


i j . Global Threshold | Ц | 


Лођа Threshold 2 


Local Thresholding Result 


= Unit- i 73 
Q.23. What is image segmentation ? Explain, 


Ans. Image segmentation subdivides an image into its constituent object 
s. 


The level to which the subdivision is carried depends on the problem to be Tx B i ый 


solved, ic. рои should stop if the objects of interest in an application 7 
have been isolated. For instance, in the automated analysis of electroni s | 
assemblies interest lies in analyzing products’ images to find the ine yis - ^ 
absence of specific anomalies like broken connection paths or dose ee 
components. There is no need to carry segmentation past the level куне. 
required to identify those elements, Segmentation of nontrivial images is very ; 
difficult task. The eventual success or failure of computerized analysis methods к 
is determined by accuracy of segmentation. Considerable care is therefore 
required to improve the probability of rugged segmentation. Image segmentation 
is used to extract multiple features of the image that may split or merged in 
manner to develop objects of interest. Analysis and interpretation can be 
performed these objects. Р 
The goal of image segmentation is to divide an image into several parts/ m Ü 
segments having similar features or attributes. The basic applications of image | 
segmentation аге — Content-based image retrieval, medical imaging, object о: 
detection and recognition tasks, automatic traffic control systems and video E 
surveillance, etc. The image segmentation can be classified into two basic 
types — Local segmentation (concerned with specific part or region of image) 
and global segmentation (concerned with segmenting the whole image, 
consisting of large number of pixels). The image segmentation approaches 
can be categorized into two types based on properties of image. |. 
| (i) Discontinuity Detection Based Approach – This is the approach 
in which an image is segmented into regions based on discontinuity. The edge 
detection based segmentation falls in this category in which edges formed due E 
to intensity discontinuity are detected and linked to form boundaries of regions. 
(ii) Similarity Detection Based Approach — This is the approach in 
Which an image is segmented into regions based on similarity. The techniques 
that falls under this approach are — thresholding techniques, region going 
techniques and region splitting and merging. These all divide the image into 
regions having similar set of pixels. The clustering techniques also use this 
methodology. These divide the image into set of clusters having similar features 
based on some predefined criteria. 


In other words, also we can say that image segmentation can be 
approached from three perspectives - Region approach, edge approach and at 
data clustering. The region approach falls under similarity detection and edge 
detection and boundary detection falls under discontinuity detection. Clustering 


techni cba MPO 
Fig. 2.17 Representation of Global and Local (Adaptive) I Lus | is Iques are also under similarity detection 
ng 


174% 
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0.24. Explan in detail the classification of image segmentation tec, 


Ani : l Unit -|| 75 ed = 
x . ч qu, > Ж А ри | 
ed for image segmentation are — th \ ‘The values of thresholds can be computed with th Fa 
Ans. The popular techniques us c : І енн І POTE 
method, edge detection based techniques, region based techniques, clus image histograms. Simple algorithms can also be generated to i ad 
based techniques, watershed based techniques, partial differential equation bi Y i (ii) Edge Based Segmentation Methods . Я 
and artificial neural network based techniques etc. These all techniques ү ] 


; ‚ f These techni 
are differ well developed techniques of image Processing on thei chniques are 


from each other with respect to the method used by these for Segmentation, segmentation methods are based on the rap; T own. The edge based 


Image Segmentation Techniques 


Clustering 
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Fig. 2.18 Image Segmentation Techniques 


©  Thresholding Methods — These are the simplest methods fü T (iii) egion 2 Segmentation Methods = 
image segmentation. These methods divide the image pixels with respect ү segments 10 "s нейна = having similar cha га 
their intensity level. These methods are used over images having lighter objec, 216 fyo basic techniques based on this method – 
than background. The selection of these methods can be manual or automat; 


image features. Ther 
are basically three types of thresholding — 


These methods 
racteristics. There 


cow 


Tegion growing based s 

segmentation methods are the methods that segments the image into various i 

regions based on the growing of seeds (initial pixels). These seeds can be 5 

(a) Global Thresholding — This is done by using an selected manually (based on prior knowledge) or automatically (based on . | 
ill be constant for who; Particular application). Then the growing of seeds is controlled by connectivity 

» Y) сап be obtained fro; between pixels and with the help of the prior knowledge of problem, this can 


| bestopped. Let p(x, y) be the original image that is to be segmented and s(x, y) 
als = 1, if p(x, y)>T _ isthe binary image where the seeds are located. Also let, “Т” be any predicate 
t : 0, if p(x, y)<T | Which is to be tested for each (x, y) location. Then basic algorithm (based on НИ 
(b) Variable Thresholding — In this type of thresholding, th 8-connectivity) steps for region growing method are — i 
value of T can vary over the image. This can further be of two types — mes (1) First of all, all the connected components of ‘s’ are 
(1) Local Threshold — 1 } 


In this case the value d 
upon the neighbourhood of x and y. of T depend 


(2) Adaptive Threshold (2) Compute a binary image Рт, where Рт(х, y) = 1, if 


The value of T is а ТРИИ 
(c) Multiple Thresholding – In this type of теру «4. L | y ima 
are multiple threshold values like TO and Т1. By using these крл s L and (x, y) is 8-connected to seed in 3 | 
таве c: _ These connected components in ‘q’ are segmented regions. 


be computed as — m, if p(x, y)» TI go i 
q(x, у) = 4n, if p(x,y)<TI Hier -.' . (b) Region Splitting and Merging Methods - The region 
- г o, if p(x, y)< TO : E and merging based segmentation methods use two basic techniques 
litting and merging for segmenting an image into various regions. Splitting 


x and y. ; 
(3) Compute a binary image ‘q’, where q(x, y) = 1, if Pix, y) 


» 
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iterati ividi i into regions having similar cha 
iteratively dividing an image Іп ving | 
= TM to combining the adjacent similar regions, Fo 
ене буя the division based on quad tree. The basic algorithm S 
region splitting and merging are — | 
(1) First of all the Ry is equal to P. 
(2) Each region is divided into quadrants for wh 


teps 


False. | 
(3) If for every region, Т (Rj) = True, then Merge a 
regions R; and Rj such that T(R; UR j) = True. 


(4) Repeat step 3 until merging is impossible. 
where ‘p’ be the original image and ‘T’ be the particular predicate, 


(a) ! (b) 


Fig. 2.19 Division of Regions Based on Quad.Tree 
(iv) Clustering Based Se 


gmentation Method — The clustering base 
techniques are the techniques, w 


pixels with.similar characteristic 
the data elements into clusters s 
F similar to each other than others. 


Н 


s. Data clustering is the method that divide 


er are moi 
of clusterin 


Wely to minimiz 
are variou 


that divides the image into set of clusters su 
to only one cluster. In other words it can be sai 
exactly one cluster. These methods use membership functi 
either 1 or 0 i.e., one either certain Pixel can belong to Particular ¢ 


àving value 
An example of a hard clustering based technique is one k-mea 


luster or no! 
ns clusteri 


Tacter ; 


ich T(R) 


djacy 


| hole in its minima from where the water spil 


| the gradient of image as topographic surface 
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equality. 


(b) Soft Clustering — This is more natura} t 
because in real life exact division is not possible due to the 
Thus soft clustering techniques are most use i 

which division is not strict. The example of su 
means clustering. In this technique pixels are partiti 
on partial membership .€., one pixel can belon 
this degree of belonging is described by membershi 
more flexible than other techniques, 


ype of clustering 
Presence of noise, 


©) Watershed Based Methods — 


These methods uses the concept 
of topological interpretation. In this the inte 


nsity represents the basins having 
Is. When water reaches the border 
together. To maintain Separation 
borders of region of segmentation. 
The watershed methods consider 


- The pixels having more gradient 
are represented as boundaries which are continuous, 


of basin the adjacent basins are merged 
between basins dams are required and are the 
These dams are constructed using dilation. 


(vi) Partial Differential Equation Based Segmentation Method — 
The partial differential equation based methods are the fast methods of 
segmentation. These are appropriate for time critical applications. There are 
basic two PDE methods — non-linear isotropic diffusion filter (used to enhance 
the edges) and convex non-quadratic variation restoration (used to remove 
noise). The results of the PDE method is blurred edges and boundaries that 
can be shifted by using close operators. The fourth order PDE method is used 


to reduce the noise from image and the second other PDE method is used to 
better detect the edges and boundaries. 


i . (vii) Artificial Neural Network Based Segmentation Method — The 
ANN based segmentation methods simulate the leaming strategies of human 
brain for the purpose of decision making. At present this method is mostly 
used for the Segmentation of medical images. It is used to separate the required 
image from background. A neural network is made of large number of 
Connected nodes and each connection has a particular weight. This method is 
independent of PDE. In this method the problem is converted to issues which 
pU solved using neural network. This method has basic two steps — extracting 
"e $ and segmentation by neural network. 
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1 0.25; Give comparison of various’ entation technig | . Unit - 1 79 
р n of ae prb Y тя c na |! 4f we consider 4-connectedness for both foreground and 
Ans. The comparison of various segmentation techniques is given in tay there are four objects that are 1 pixel in size and areira m background, 
: : . € hole, 
| connectedness, then there is one object and no hole. Intuitively fe 8- 
! ? cases 


Table 2.2 Comparison of Various Segmentation Technique; we have an ambiguous situation. A similar ambiguous situati 


simple case like — 
[1] 0] 
ofi 


where if the 1s are connected, then the Os should not be. 
To avoid this situation, different connectedness should be used for objects 
' and background. If we use 8-connectedness for A, then 4-connectedness 


^ should be used for A . An example ofa simple binary image with its boundary, 
% interior and surrounding is shown in fig. 2.20. | 


`|Ехрепзїуе method i 
terms of time and m 


2.2. 


On arises in a 


spatial details are m 
considered 


(ii) |Edge based 


between objects. 
This methods based | Моге immune to 

on partitioning image | noise, useful when it 
into homogeneous ге | is easy to define - - 
_|gions. 17.5 [similarity criteria, 
This method is based on | Fuzzy uses partial 
division into homo- | membership there- 


(iii) Region based 


Determining membe . 
ship function is ny 


(iv) |Clustering 
method 


geneous clusters. fore more useful for easy. a 
| teal problems. = | - 

(v) [Watershed ` [This method is based | Results are more stable, | Complex calculation: 
on topological interpre- | detected boundaries gradients. ш 


tation. are continuous. 
This method is based | Fastest method, best |More computation! | 


(vi) |PDF based 


method on the working of diff- | for time critical appli-|complexity. | E (a) Original Image (b) Wl Boundary Pixels 
А cations. | | | E] Interior Pixels 
(eH) ANN based [Men unge om [Г] Surrounding Pixels 
In training. | ә 


| Fi ig. 2.20 A Binary Image with its Boundary, Interior, and Surroundings 

—.. The boundary of A is the set of pixels of A that have 4-neighbours in А. 

The boundary is usually denoted by A'. The interior is the set of pixels of A 

©] that are not in its boundary. The interior of A is (A – A’). Region T surrounds 
region A (or A is inside T), if any 4-path from any point of A to the border of 

е picture must intersect T. 


na pemen of A) that he iain 2.2 7. Explain the term connected components labeling. 
Я KP x 2 i binary image B is a labeled 
à 4X0 HN сыс. snow below image u in ipaum of pus micis label of its connected 
| Component”, | | 
|... One of the most common operations in machine vision is finding the 
Connected components in an image. The points in a connected component form 
... & Candidate region for representing an object. As mentioned earlier, in computer 


Q.26. Discuss about the connected components. 


‚ Ans, A set of pixels in which each pixel is connected to all other pixels! 
called a connected component. 


The set of all connected components of A 
points on the border ofan image is called the Бас 
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vision most objects have surfaces. Points belonging to a Surfa , 
, : з nofi сүү; Moen?) 4 ce р. Unit -Il 81 

spatially close points. The notion of "spatially close" is cg : Dro; — : | 

p У " indi vitat treaties. However the y an capture by c, a „omponents becomes one of finding a pixel whose value is — 1 in LI, assigni | 
components in c s д iss "^ | er, the connected Component a) F a new label, and calling procedure search to find its nei hbe oie ины 
usually form a bott € in a binary vision system. The algorithm iss ҮТ _ | and recursively repeat the process for these M have 
; Я ause the arg i H arte d Se й ; tes nl ÉD rS. ili : 
in nature, because the operation of finding connected components i. a | function neighbours (L, N) is given a pixel position defined т. “ты : 


operation. If there is only one object in an image, then there MAY not be hy returns the set of pixel positions of all of its neighbours, using either the 4 i я 
а neighbourhood or 8-neighbourhood definition. Only neighbours that represent : МИ 7 


e me р ^ . : е; та оБјес, he bi 
erties and location ih, itions on the binary image are : 
i s need to be found, then the Conte legal рит a. mia E fis t^ id The neighbours are returned 


components must be determined. ve - 
tin scan 

| А component labeling algorithm finds all connected com 
image and assigns a unique label to all points in the same compo ipe 
(a) and (b) shows an image and its labeled connected Loc не 
applications, it is desirable to compute characteristics such as m in mi 


bounding rectangle of th А > Positig 
components. Эш IE lebeling the 


ts iy, 
Fig ài 


(a) Four-neighbourhood (b) Eight-neighbourhood 
Fig. 2.22 Scan-line Order for Returning the Neighbours of a Pixel 
Algorithm for Recursive Connected Components – Compute the 


- connected components of a binary image. Here, B is the original binary image 
| _ and LI will be the labeled connected component image. 
HH procedure recursive connected components (B, Ll); 
{ $ 
LI := negate(B); | 
label := 0; \ 
| find components(LI, label); 
(a) An Image (b) Co Е 
nnected Component Ima i : 
— Fig. 2.21 a n | 
“40, Explain the recursive connected 
Ans. The f component algorithm. ; 
атн аге а number of different algorithms toe the ted "оаа 
can fit in ШЕП ol ў н algorithms assume that the BUS imas 
/ ploy a simple, recursive algor; M = Row l 
—€— at н а but can move all over the image a sana on om for > to ie Е m 
uppose that B is a binary image with M 0, | or N := 0 to Max 
; üx | 
+ dam We wish to find the connected compe i rows and MaxCu if LI[L, N] == - 1 then 
ави : iris output image LI in which every pixel E ше l-pixels and ( 
5 ie e component. The strategy, adapted from tid he labd кылаш 
ext, is to first negate the binary image, so that all the 1.5; the Tanimoto A! abel := label + 1; 
This is needed to distinguish unprocessed pixels "Pixels become —1% search(LI, label, L, N); 
component label 1. This can be done using a functi Г D from those oi } 
inputs the binary image B and outputs the negated н es negate thal 
become the labeled image. Then the. process of indi LI, which will ) 


ng the Connected. procedure search(LI. label, L, N); 
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{ 
LI[L, N] := label; 
Nset :=neighbours(L, N); 
for each (L', №) in Nset 
( 
ifLIL, N] = = – 
then search(LI, label, L', №); 
А É 
) 
The first five steps of the recursive labeling algorithm а 


component of the binary image of fig. 2.23. The im PPlied to the fiy 
& » 4.23. 1l a h : US 
labeled image LI. The boldface pixel of the i See (Partially 


by the search procedure. Using the neighbow i S 
222, the first unprocessed neighbour of the 


is selected at each step as the next pixel to be processed. 


um А 


ла ty ae 


(a) Binary Image (b) Connected Components Labeling 


(с) Binary Image and Labeling, Expanded for Viewing 
Fig. 2.23 A Binary Image with Five Connected Components of the Value 1 


0.29. Write short note on row-by-row labeling algorithm. 


Ans. The classical algorithm, deemed so because it is based on the classical 
connected components algorithm for graphs, was described in Rosenfeld and 
Pfaltz in 1966. The algorithm makes two passes over the image — one pass to 
record equivalences and assign temporary labels and the second to replace 
each temporary label by the label of its equivalence class. In between the two 
passes, the recorded set of equ ivalence, stored as a binary relation, is processed 
to determine the equivatence classes of the relation. Since that time, the union- 
find algorithm, which dynamically constructs the equivalence classes as the 


equivalences are found, has been widely used in computer science applications. 


The union-find data structure allows efficient construction and manipulation 
of equivalence classes represented by tree structures. The addition of this data 


structure is a useful improvement to the classical algorithm. 


0.30, Discuss about the hierarchical image representation. 


Ans, In segmentation an image can be represented at nus ie 
resolutions. By reducing an image's resolution, i.e. reducing the i а 
array, some data is lost, making it more difficult to recover : eie 
However, reduction in resolution results in reduced memory an n A 
requirements. Hierarchical representation of images allows i aen ж a 
multiple resolutions. In many applications, one can compute prop 
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3 n images first at a low resolution and then perform additional com 
i | | a selected area of the image at a higher resolution. Hierarchical ы) Чабор, 5 
are also used for browsing in images. Two commonly used forms ер Tesentay \ 
4 il representations are pyramids and quad trees. : Of hie, 
61 | | (i) Pyramids — A pyramid representation ofan n x n im 
the image and k reduced versions of the image. Usually n is a po ge Cont, 
x the other images are n/2 X n/2, n/4 x n/4,....0 x 1. Ina иие кы of? à 
of an image, the pixel at level / is obtained by combining info Presentat, 
а” | several pixels in the image at level / + 1. The whole image is re nag fg, 
single pixel at the top level, level 0, and the bottom level ы аң 
(unreduced) image. A pixel at a level represents aggregate the origin 
; represented by several pixels at the next level. An image and its red Informati, 
3 ina pyramid is shown in fig. 224. Here the pyramid is Shining’ b Versim, 
— the gray values in 2 x 2 neighbourhoods. It is possible ho У Simpl 
ise other strategies to form reduced-resolution versions. Si eae 
if ao taper the pyramid in nonlinear ways. - Similarly iti 
. Jl nimi i inti Я 
| dum aded) oral point is that the entire pyramid fits into a linear amy 


Level 2 Levell Level 0 


п 


1х1 


Е ҮТ 


2х2 


4х4 


8х8 


Fig. 2.24 <0 Ga 

.. (i) Quad Trees — A quad tree may be considered i 
pyramids for binary images. A quad tree contains three type, с 
+- black and gray. А quad tree is obtained bys yh 


* 


od 


tension of 
3 — white, 


"S Unit - Il 85 to В 
Aregion in an image 15 split into four subregions of identical size, as shown i T HM 
fig. 2.25. For each subregion, if all points in the region are either ae : : 
plack, then this region 1s no longer considered as a candidate for splitting; " it 
contains pixels of both kinds, it is considered to be a “gray region” i 5 
further split into four subregions. An image obtained using this recursive ride 3 
is represented in a tree structure. The splitting process is repeated until ee v 


(a) Original Image, "Gray Region" 


1 2 3 4 
(b) Original Split into Four Subregions 
(The Left Node in the Tree Corresponds 
Y 


to the Left Region in the Image) 


(c) Split the Gray Regions from Fig. 2.25 (b) into 


Four Subregions. One of these Regions is Still 
a Gray Region 


itti Region 
d) Splitting of the Last Gray 

ч or the Final Quad Tree 
Quad Tree 


iH ` Fig. 2.25 The Building of 4 
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are no gray regions in the tree. Each node in this structure is either a | А А 2. | iy 
or has four children — thus the name quad tree. sar Nog. Qoa Epes the method of region splitting and merging for region ri 
segmenta on. ee 


Quad trees are finding increasing application in spatial databases. Sey ‘Ans. Assume that, R is the whole image region and choose a predicate Q 
algorithms have been developed for converting a raster array to a quaq a method for segmenting R is to subdivide it successively into smaller and 
and a quad tree to a raster array. Algorithms for computing several pi rh E cr quadrant regions so that, for any region R; QR) = TRUE. We begin 
properties have also been developed. Ty s ith the whole region. If Q(R) = FALSE, the image is divided into quadrants. If . 

dE FALSE for any quadrant, the quadrant is subdivided into subquadrants, and i 
` 


versions are successively obtained by averaging four points. <o on. This specific splitting method has a convenient representation in the form 
esent trees in which each node includes exactly 


31. Write short note on spatial clustering. of quadtrees. Quadtrees repr 
0 р ring. four descendants as shown in fig, 2.19. Note that the root of the tree corresponds 
and that each node corresponds to the subdivision of a node 3 


Ans. It is possible to determine the image segments by simultaneous, to the whole image, 
combining clustering in measurement space with spatial region growing. Sui into four descendant nodes. In this situation, only R4 was subdivided further. If 


a technique is called spatial clustering. In essence, spatial clustering schema loyed, the final partition includes adjacent regions with identical 


. : only splitting is emp! 
combine the histogram mode seeking technique with a region growin properties. By permitting merging as well as splitting, this disadvantage can be 
de: remedied. Satisfying the constraints of segmentation requires merging only 
the predicate Q. That is, two 


spatial linkage technique. 
According to Haralick and Kell : adjacent regions whose combined pixels satisfy 
locating, in turn, all the peaks in the а eem en d € done by fin, adjacent regions Rj and Ry are merged only if Q (Rj ы Ry) = TRUE. 
расе histogram, and tha · ү, іѕ customary to satisfy a minimum quadregion size beyond which no 
ariations of the preceding 


determining all pixel focations havin; 
Nm А : g a measurement on th 
beginning with a pixel corresponding to the highest peak not me са, Ney further splitting is carried out. Numerous v 
both spatial and measurement space region growing ari x process fundamental theme are possible. For example, a significant simplification results . 3 
performed in the following manner. Initially, each se А t s dod taneous) уе permit merging of any two adjacent regions Rand Rj if each one satisfies . d 
value is on the current peak. Consider for possible 1 E ан вн рх] whot the predicate individually. This results in а much simpler way due to testing of i 
a neighbour of this pixel (in general, the nei nclusion into this segmen — (he predicate is limited to individual quadregions. This simplification is still { 
growing from) if the value of a Шош (ап ee b ier we ar capable of yielding good segmentation results. 
is cl i х ‘or an N band ima ; . | 
pe th = space to the value of the pixel and = 0.33. Write down the splitand merge algorithm for region segmentation. \ Р 
аг EM 

growing from. ger than the probability of the value of the pixel we ax Ans. Split and merge operations may be used together. After a 

presegmentation based on thresholding, à succession of splits and merges 
may be applied to refine the segmentation. Combined split and merge algorithms 
are useful for segmenting complex scenes. Domain knowledge may be 
introduced to guide the split and merge operations. 
it. Initial li ; i Suppose that an image is partitioned into a set of regions, 
s sally pima ain Jayeta ark сны ей Wy Tinkan Ee el” ги i in a region will be homogeneous ace 
spatially corresponding 4 x 4 : geach parent pixel to th 2, ...., m. All of the pixels in a reg g ‹ 
parent pixels has 8 child ixel block of child pixels, Each pair ий f property defined py a predicate A applied to the region. The predi 
block of parent pixels Tha к oe кынамы Each child pixel ig linked ас we the similarity between the pixels ina region. For example, the p 

p . The iterations proceed by assigning to wa 2х be defined using the variance in gray values within a region — 
each parent pixel 
| if the varianceis small 


the average of its child pixels. Then each child pixel compares its 
8 its value with 
A(R) = 


Another spatial clusterin i 

g scheme is a spatial i i 

жае ы of clustering. The bottom layer of the Di courts 

| аре. Еас successively higher layer of the pyramid is an j Rs өй 

the number of pixels per row and half the number of E En. ping 
o 


{Ra} forn=1, 
ording to some 
cate represents 
redicate could 


each of its parent’s values and links itself to its closest parent. Ra, 

new value is the average of the children to which it is linked i 0 otherwise 

converge reasonably quickly and for the same reason the ISO v vie iterations i i " 
DATA iterations Algorithm for Split and Merge Region Segmentation – 


converge. If the top layer of the pyramid is a2 х 2 block of great. - S Me 
then there are at most 4 segments which are the respective "Brand 5 Step 1 – Start with the entire image as a single region. . 
Sreat-grandchildren \ Step 2 — Pick a region К. If A(R) is false, then split the region into four 


of these 4 great-grandparents. : 
Ё VM gions. 


алеге a srt yt 
j^ 
. 
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Step 3 — Consider any two or Шла 


| more neighb 
77» Ry in the image, If А 


uring зигер; 


(Ri UR2 U..UR ) i ons icati tion arises from a relative disp] 
] ] i E^ > R lication mo ё 5 Ive displacement between 
into a single region, n) is true, merge the, PR un d the scene being viewed such as in robotic applicate 
Step 4 — Repeat these steps unti] no further sp); "Bion, gation and dynamic scene analysis. The use of motion in 
0.34. Discuss about the rule b T Splits ОГ merges tak n. we used spatial techniques. One of the simplest approaches for 
AHSA rfe based ased Segmentation A image e Place between two image frames f(a, b, с) and f(a, b cj) taken at 
and Levine in | 984 Fee ks System for Segmentation fas ementa, ely, is to compare the two images pixel by pixel. One 
Processes such as the line orang to this approach, System Oduceg by vy this is to form a difference Image. Assume that we have 
analyzer, the focus of analyzer, the initializer, the regio, COntains а к image containing only stationary components. Comparing this : 
le, the long term ма attention, the scheduler, and fetes analyze, the s м a subsequent image of the same scene, but including a moving Я 
short term memories hola mes (LTM) and Short term т associate memor rea in the difference of the two images canceling the stationary Bae! 
output. Other hand, Ina the input Image, the Ner tes (STM) Th Jeaving only nonzero entries that correspond to the non stationary 
about low-leve] Sementa the mode] Tepresenting ai data ang the mponents. : 
rules in the LTM against th : = control strategies, A pera Stem Knowledge ference image between two images taken at times c; and cj may be ? 
доа dus аааз usually inet І Ве STM. When a maip ichs | neds 1, if|fa,b,ci)-f(a,b,c;)|>T 
ў У j c P е) 2 Ye] Е 
rules — Pis oe чата, the mode] pe in modification is Performed руа, b) = аат r J (i) 
Я Be ru es, the Contro] e TM has thre ed, 4 Е | 
regions. e Knowledge Rules Encode the highest rules, © levels of i "is a specified threshold. Djj(a, b) has a value of 1 at spatial ў 
actions 5s nd and area in the form of situates about the Properties of ates (a, b) only if the intensity difference between two images is 
extending a Jj suiting а region, Тегріп ра Pairs. The Spec is ably different at those coordinates, as determined by the specified t 
These rules = 2 as EIE two lines, and [NE arr adding, deleting, o. б TI is assumed that all images are of the same size finally we note \ 
@ су digi by their action, fying focus of attention aiea ues of the coordinates (a, b) in equation (i) span the dimensions E M 
focus-of attention 0. Rules — These rules are бу ыл: images, so that the difference image Dij(a, b) is of the same size as the 
data entry to be us spud rules, Focus-of. vided Into two Categories — the sequence. ; f 
control the focus-of. i a région, a line Nerii rules find the next { | 
; n Strate, ef ire area, Th 222 
ња 008 don't modify the у The infe E36 Tiles i 56 URES, EDGE 
m in the 269 2 DATA STRUCTURES, 
they alter the matching order обе Sita in the short paca карина, (ING оаа амі LINE FITTING, CURVE 
їй) Hi, erent mory. Instead Я 
а rie Level These ra edge lese, 7 19080 FITTING (LEAST-SQUARE FITTING) 
j 5 аге КЕРЕ - 
а рту execute strate 
given set of data, 5 the most appropriate ks Tules that select ional area extraction ? Р 
The Conditions of the rules base mol strategy for 0.36. How do threshold values affect conventiona | 
Icting a logical o, 2 are made J ao hod for extracting area is i 
peration UP of firs { 4 i ing, the conventional metho 1 E 
ting the data entry on 4. be Performed on the is Symbolic qualifier dns, In image processing binary image by setting a threshold value. 


2 image to : j 
КЕ Пе gray scale Mens threshold value separated from those lower 


1 values greater th i ite blob corresponding to 
г" ‚ an image showing а white blo c 
1 ger uus e ЫШ канак However, the selection of the 
'shape of the animal on a blac area of the pig area. If the threshold 


PCcific strategies , ause jp "ceritical in determining the 
"ra BICS to be Incorporated Without M 18 general but 'eshold value is dap rea atthe pig will be missing, so the calculated 
escribe the motion based segmentation, ging the code, alue is too high, some m e in 16 Pal arca If the threshold value is too mn 
2 ^ t AM he emaller than К rie à | 
ао ion isa Very powerful сие used by h a ig area will be d environment could be counted as part of the pig, 
lo extract objects or regions of inter У humans ang many ome brighter parts о ^7 © rea to be larcer than the actual area. Fig. 2. 
cst froma back Y Other sing the calculated pig а ыз Esch e to different threshold values. 


sofa 


ws the binary image: 


= 


DI etree Дырын aa „„„ 
х= оз TRA 


ith CamScanner 


Scanned w 


the tree. For example, the union-fj 
(a) Gray Scale Image | 


» 4 and 8, and so on, No 
since 0 represents the background labe] 
anode has no parent. 


е parent label of a set, Here, X 
the set. PARENT is the array 
taining the union-find data Structure, 
procedure find(X, PARENT); 


1 (а) (b) 
Fig. 2.28 


ion procedure is given for two labels X and Y E the ан xs 
mage as i i t containi 
t i by dog modifies the structure (if necessary} to merge the se 
ттен onore te binary ye M Set containing Y. It starts at labels X and Y and follows the parent 
Image a function of the threshold value '. 1 
The arrow In th Hip 7 


P the tree until it reaches the roots of the two sets. If the = т 
me, one label is made the parent of the other. The due ses 
*n here arbitrarily makes X the parent of Y. It is also bc ben 
le set Sizes and to attach the smaller set to the root of the larg d 


Corresponding threshold value ; doa! «Thren t 
detection method, $ ad dge Mar 


(96 08 мш 
Value t — p 


wth -Fi Í E a 
2.37, What do you m, 2.27 Пг; 


t the union of two sets. Here, X is the label of сеш ple 
bel of the Second set and PARENT is the array containing 


‘UCture 
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Procedure union(X, Y, PARENT); 
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while PARENT[j] <>0 

j := PARENT, 17; 
while PARENT[k] <>0 
k;- 


x 
d 
^ 


LL X-ray Image ofa Result of Point 
— — Turbine Blade Detection 


i Result of 
д Thresholding 


Fig. 2.31 Representation of Point Detection 


and edge detection method ? 


Ans. There 


Line detection is more complex then point 
S represented in fig. 2.32. When the first mask 
his mask would Tespond very strongly to lines 
€ line passed through the middle row of the 
€ would be obtained in case of a constant 
mple array of 1° with a line of different gray 
nning horizontally through the array, this can be verified. The second 
sponds best to lines oriented at +45° and third mask to vertical lines 
fourth mask to lines in the — 45° direction. These directions may be 
ished also by noting that the preferred direction of each mask is weighted 
a larger coefficient as compared to other possible directions. Let Ry, Ro, 
R5, R4 represent the masks’ Tesponses in fig. 2.32, from left to right, and the 


if 


Oriented (Б) Lines Oriented (с) Lines Oriented (d) Lines Oriented 
— Horizontally at *- 45" Vertically at — 45 


Fig. 2.32 Representation of Line Detection Mask 
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four masks are run individua 


Unit - |. gs 
rough ; _ (mmy Sobel Mask — It was developed by Irwin Sob L lt 
certain point in the image the point is said to be more likely o central differences but sive el. lt uses the 
line in the direction of mask i, Alternatively, for detecting 1 pto 
direction, the mask attached with that direction i 


S greater weight to 


the central 
! ing averaging. The Sobel mask са 
ion is used and į à E ng 


n be considere 


x : das3x3 
ns to first derivatives of Gaussian kernels, 
trongest responses for one Pixel thick lin ions els arrangement shown in equation (i), Sobel edge detector 
points correspond nearest to the direction Specified by the mask e. pix 
(tii) Edge Detection – The 


vatives are computed as given below — 
pe Gx - (a5 + 2a ta)- (ag + 2a; * ag) 
Gy = (а; + 2а; + a4) - (ap + 2a, + a) 
ye] edge detector mask is as follows — 


(vi) 


í d ing, it is on i à (vii) 
‚ Points where brightness cha 


-] -2 -1 

: "n =| 0 0 0 

ighbourhood with Strong signs Sx 1 2 1] 

edge detectors work based on calculating the intensity gradi 

image. Edge detection is used to Tecognize areas of an im: -1 0 1 (ix) 
Ра. К б,=|-2 0 2 ea 

change in Intensity encounters. These changes are mostly related w Y 101 

physical boundary in the Scene from which the image is der E 

characterise object boundaries typically in images, Edges a 

identification, i i 


k is better as compared to 
ion feature of a Sobel mas 
ise suppression 
її mask. 


T" һ. 
Explain the difference between edge and line with grap! 
7, Expla 


. R.GP.V., May 2019) 
— It uses the basic con ; 


difference. Assume the pixels arrangement with respect to 


cept of centra] 
[i, j] as follows — 


the central pixel Refer to .Q.38. 


lines and poir с а о ormatio! about the various regions 
t апу ot finf lon 
, S 


h y 
J 
B These features are usually termed as local features, since the 
rty gi h 
f p one. g 
ted from the local pro € al hough the edges ar d lines are 
4 


level, yet there is an шры 
demarcates between tw 


both detected from the abrupt change in the gray 
The Prewitt edge detector partial deri 


ssentially tween 
eur. An edge essentia "dee is the border һе! 
( : усте the т, eh means that an sed be piri inside a 
Gx = (ау + Саз + a4) — (80 + Сау +ас) 1, У different AN e. on the other hand, may may run between 
= Рн i t regions. , 
and Gy = (as + Cas + a4) — (ay + Са] + a) Aili) feren 


For example, a thin lin oint is embedded 


uni eneous region. vegetation. A p 
Blot E dud land, bearing the same vee 
0] ots or agri 


Gx and G, represents the approximations at li, j] 
Putting C 1, the Prewitt edge detector mask 


becomes. 
-l -1 -I]. к д 
„ШЫ К 
по E 
бү= E 0 0 v 0) 


і differentiates in 
itt edge detector mask 
| ДЫ; ele disechon. Hence, edge detector is less 
The few edge detector masks have longer support, 


one direction while 
Vulnerable to noise. 


ic Image 
| ar Um Synthetic 
piale d one Row 29 
Р dge Pro, 
Fig. 2.34 Е 


"E 


^ —.— 
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УГРЫ" 3 


А (а) Step Edge (b) Ramp Edge 
ee iS (c) Line (d) Step Line 


Fig. 2.35 Different Types of Edges 


inside a uniformly homogeneous region and its gray value is differe 
average gray value of the region in which it is embedded. This is a 
à spike. The changes in the gray levels in cas 
ramp edge are shown in the form of an edge р 
forms and nature of ideal edges and lines, suc 
step line, are shown in fig. 2.35. 


nt from the 


nalogous, 
e of a perfect Step edge, in 


rofile in fig. 2.34, The divers, 
h as step edge, ramp edge, ling 


0.40. Explain local analysis method of edge linking and boundary detection 


Ans, Ideally, pixels lie only on edges but in actual practice, set of pixels 
seldom characterizes an edge completely due to noise, breaks in the edge from 
nonuniform illumination and other effects which result in spurious intensity 
discontinuities. Hence, to assemble edge pixels into meaningful edges, edge 
detection algorithms are followed by edge linking methods 
Local Processing 


of pixels іп a small neighbourhood about every poi 


(ii) Gradient vector direction. 
The value of Vx provides first property as 
Vx = [01+ [Gy] i) 
An edge pixel with coordinates (хо, Yo) 


in a prédefined пеј bourhood of 
(x, y), is similar in magnitude to the pixel at (x, y), if сог 


|Vx(x, y) 58 Vx(xo, уо)Ї SE: 
where E represents non-negative threshold. 


Following expression represents the direction.of the 


given below — 
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pixel at (Xo, Yo) in the predefined neighbourhood of (x, y) has an 
‘to the pixel at (х, y), if 

lox. у) Е о(хо, Уо) <А (у) 
yn-negative angle threshold. 


"uation (111), we can see the direction of the ed 
tor direction at (x, y). If direction and magnitude criteria are 
Xinthepredefined |.— — WIDE 


-— Original Image 
d of coordinates Т 8 [ RU \ 
i: bad Components of Sobel 
Vertical Operators 


ge 15 perpendicular to 


М = Components of Sobel 
this [шк at CARO Horizontal Operators 
intain a { 
ints as the center of — Linking Points 


ood is shifted from ышы ү i] 
other pixel assign БИХ Emcausmm n 
y level to each set Fig. 2.36 Representation of Edge Linking 
pixels. 


Di ing via Hough transforms and graph 
Я Ea НАНЕ и (К.СР.У, June 2017) 
niques. í 
obs i i h Transform — This method was 
] Processing via Houg \ 
чаан in 1962. At a point (ху, у;) the general equation of a 
n slope-intercept form is given below — 3i 
1 s Me ia poi i lues of m 
infinite lines may pass via point (x; yj). For de hens 
|satisfy the equation (i). Equation (i) can also be w 
c= у — MXi 
da i i ine for a fixed pair 
| 1 ion of a single line for a 1t 
-plane provides equation ped ке 
ps E E point (x;, yj) also has a line in TT с 
UA n. oint (а' 8) this line intersects the : I 
i E [үи the slope and b' represents Inte P 
ere а 


il) 


ll 


c=mxyt¥i oc 


y cz-mx*yj 


..---|b' 


Q f " 
а у) 
LJ 


m 


ace Representation 


a 


eter Sp 
ne Representation (b) Param 


Fig. 237 
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line containing both (х yj) and (х;, yp points in the xy-plane, | 
contained om this line have lines in parameter space Which ij n fact, ар 
These concepts are shown in fig. 2.37: `` YE htérsect at 


Hough transform is more popular because 
is allows subdividing the parameter space into 
accumulator cells shown in fig. 2.38, where 
Cmax and Стіп represent expected range of 
intercept values and тау and mg, represent Я 
expected range of slope. The cell at coordinates ; rica line m 

d jn fig. 


urves in place of straight lines in the p9-plane. As before Q 
a line x cos Ө; + y sin Ө; = p; yield Q sinusoidal curves 
parameter space. For increasing values of Ө 


idal © 


; оп 
Poi E ө) in the 


( 0 


> being equal to the positive x-intercept. In a similar way, a 
р 9 — 905, with p being equal to the positive y-intercept or 
= 90°, with p being equal to the negative y-intercept 
g to equation (iii). : : 
oretic Methods — It is a global approach P 


ing via Graph-The 


(i, yeah accumulator value A(i, j) corresponds 
to the square which is related with З 2j 
М parameter c : 
space coordinates (mi, cj). At the start; these m. and linking based on showing edge segments in the form ofa 
the graph for low cost paths corresponding to significant 
d method. Which works well in presence 


sentation gives a Tuggel 


a =i i zéro value. Then; for every point 2 
хү, Ук), let the paramı | i a s This 
безш. иле шшс ed ta Fah of Fig. pa Representation П h ET. the method is more difficult and needs more processing . 
and corresponding c value can be obtai di ч ож : em = bai S QNO) are ones 
n m о е ined from equation c = -mx, +y, non en A ovg as graph G= N, U). Arc refers to a each pair (n; | : 
off to the closest permitted value in the Ж ; ; h refer as a graph G in which the arcs are directed. The vts E 
' cted graph rete if an arc is directed from node 


as a successor of the parent node ni 


ipansi identifying the s 
ansion refers to the process of iden 
ae 1 zero has a single 


d in each graph. Leve 
fie al level are known as g0 


successors of a Ee 
node which is 1$ 
al nodes. With B 


im ie d of mp outputs in solution cg, we let A(p, q) = A(p, q) + 
‚а value of Q in A(i, j) corresponds to Q points rdc Wing | 
on 


the line y= тх +c; ae: 
by the aber of M e A colinearity ofthese points is determine 
Я | ions in the mc-plane. For-&v i evel is speci 
value of'c co : pedis ‚ For every point (хд, бру. і 
subdividing ое be ES k possible values of m can be о kn the root e eos ieee : А 1 
ООДУ crements. This proc bi Д a cost c(n;, nj М Ы | 
with n ima : CIT This process requires nk computat p J bei i 
does not sei sa чес аена is linear in n and the ciitiplicafionat $n, tonk wih papa зе e D | 
n. The problem with using th computations uriless k approaches or exceeds ? 20 nts the cost == Ve 
the slope approaches infinity as he Tin у= mx’ is to show a line for which k Following expression терге e э, ° 
problejn the normal representati ine approaches the vertical. To solve this | "ne +o, 240 Representation — 
| E on of line is given as below — k ` | | de f Ed, Element ‘ E 
NES MORE . . 73 \ Ў NO of Edge ё 
' “In constructing accu : s үн узшӨ=р „ун шу .. (iii) c= Y etnia n) © between Pixels p and 
Е onstructing accumulator table, the use of thi tati T “the bo two pixels p an 
to the method disci: r table, the use of this representation is identical Binh TU dary between two P 
jT UNO kee аена, ш ШОШ slop e-intercept representation: idc, t et us define an edge element as E de xy coordinates of points p and 
ач бн) e og, unas that p and q are four ppp 2.40), the pairs (xy Ур) балу 
- fy edge elements. Ін онна of connected edge elements 8 уз 


Ап ašsociated cost of each edge clement 

Be | ЫЕ 
Кер, 9-H- (xp) - sige image, and x(p) and x(q) $ 

is highest gray level value in the images 


level values 9 
represente ich are outs! 


the numbe 


Tm ^ 
bdivideg into’ Cells 


(a) ‘Normal Line Representation 
` | ` Ей. 


ith CamScanner 
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t of equation, y = mx + с, the objective is to determine 
о 


ora line fi ts, (ху, Ур. The least square fit is made by minimizing 
, p2/v' 


e data рош 


.)* over all data points, i= 1, ...., n. The solution : 
à in manner to decre ors, E(y Уі imultaneous equations, Ly; = mYx; +cn, an 

: of finding. Let, гп) Tepresents an estimate of the Cost of a minimų olving the si data points, i, to obtain m and c. 

: Е from the start node o to a 80al node, where the path is co M cost Da for all data p 2 


хі +CĒXi» blem in blindly applying the least-squares method just 
‘be a pro 


i ications. That is, the method is not independent 
p. i m. the line approaches vertical, this method of 
d on, and 25 де Ж. inappropriate. Qne approach is to minimize y-error 
| ' = n. 5 j = соь +45 degrees around the horizontal and to d 
| ES. ed ia be+45 degrees around the vertical. A more em 
Ec Р not with respect to x or y, but with respect T e 
; ode line in any orientation. This can s n a a line- 
Bed princi i ination or eigenvector line fitting. 
called principal axis determina 


. (iv) Indicate closed the Open node n IF esti 
П IS smallest, 


i I$ not indicated, set 


(ni) = g(n) + сп, п) ; 
and indicate jt Open, and di 


E. о 
ais д У poo 
(viii) IF a Successor n; 18 indicated closed or open, update its value by- ко (b) Е 
dics fo = mun[g(n;), g(n) + с(п, nj]... (a) Ww 
20215 open those closed successors whose g' values = Fig. 2. ints (1,1), (2, 2» 
and redirect to n the Pointers from all nodes Wins g sis ae анаты r i иайда dw e е рУ | 
and GoTo step (iii). EET Jowere Example — Using Houg d the equation of line. 
(ix) End sho’ d (3, 3) are collinear and ке ine 
This algorithm does not i аы о" that, the equatio 
cout path, d Provide guarantee that a path find by itis minimum- dS mx-c: Hough | | 
Р : ; | 73 erform ^ 
0.42. Explain the term line fitting, Mo rg г ащ bs Feed te convert line from ыа 
Ans. A simple way to fit a straight line to a curve s. E | lane: 
| urve ig ify the e to (m, c) p i 

endpoints ofthe curve as the straight line endpoints, On-valued piae], 00У Ше E is (m, c) plane 18 

in fig. 2.41 (a) and two corner fits are shown in fig, 2.41(b) and (о) Ee Wh : T 

(b), the fit is made exactly on data Points. Because of this, the endpoj 18. 2. : 3 tep x +c 

to connect, but the corner angle is not as sharp as intended in the orj M T» r a 1) 
2.41 (c), a least squares fit was performed to two sets of poi Kial, In E. D. E c 
intersection of them was taken as the corner locati i е 


) +1 *Dpm-l 
: : ; a T c=-m =-mt 
corner. However, this may result in some portions à sharper 


of the curve hav; Ка = 0 then Us о+1)с=1 " 
error with respect to the fit. A popular way to achieve the lowest аур | E If m = 0 then (С м Fig. 2 
for all points оп the curve is to perform a least-squares fit of a line tor point (m, c) - (5 P 


" 
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Similarly for 
If (x,y) = (2,2) then (m, c) = (1,2) 
If (x,y) = (3,3) then (m, c) = (1,3) 
Step 2 - | 
Plot a graph К 
T pr for (m, с) = (1, 1), (1, 2) 
St - igi 
bz Me 3 – The original equation of line i 
с) put the value of m and con this 


ч equation, 
Ф. Then y=x 
EC Points (1,1), (2,2), and (3,3) are collinear - . А 1e 
. : Ы ig. 2. Ран z 2 
Eu | it Describe the curve fitting of i IS. 2.43 CRE ANALYSIS EGION PROPERTIES, EXT REMAL. 
А ; ; TuS Curve-fitting refers to th eas PATIAL MOMENTS, MIXED SPATIAL | 
I" неет aaa nn og mma а — 
3 y ip l 2-dimensi Broup. This d €SSIOn ү; 
E j Such as the diei nsional QD) d 2 5 data has о 0 
d : n ; gital image that i ata is represented ne OF two A 
У | the image or block (small ve 6 че and processed nee of data 01. Write down some basic properties of region. 
ty means that An: "Thé properties of the regions become the input to higher level 


decision-making tasks such as recognition or 


es that perform 
s that can produce 


on. Most image processing packages have operator: 

roperties for each region. 

е basic properties of regions are as follows — | 
i) Area of Region — We denote the set of pixels in a region by Rg. 

plest geometric properties are the region’s area denoted by A and the 


min Уау) a(x, yy 
h i á | 
Where g(x, y) is the original inte 


d'denoted by aCe): The area of region defined as ~ ' : 


E Aur, uini, Aeon 
a : (rece) eRE : 
f the pixels in the region Rg- 


min 
a,b,c 


eans the area is just a count о 
(ii Ceniroid of Region - The.centr 
* location of the pixels in the set Rg. 


oid of region Gesce) is thus the 


The centroid is defined as — -^ 


« | OPERARE t | Fs 
Sie Mes carcer eh 
can be generali i parameters fi d Оет: 
ized for higher-order polynomials rer: \ айла 2 | 
н Ата 
: i (&c.)eRE 
ova уну aceto et A РАЧА : | urge 
! Noté: that. even though (re. Ce) € Rg іѕа pair of integers бесе) is 
n of tenths of apixelis justifiable 


generally nota pair of integers, often  precisio 
ў ‘for thé’ centroid. 
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p Second Circularity Measure of Region — Haralick proposed 
Hey measure in 1974 to overcome the first circularity measure " 


(iii) Perimeter of Region — A simple definition of 
the p 


Lm эы holes is the set of its interior border pixe] € Perime 
ki ; e ; by РЕ, and a pixel of a region is a borde : Perimeter ор ad circularity is defined as — 
eigh uring pixel that is outside the region. For e T pixel if it h те, 
perimeter pixels will be 4-connected, wh xample, the res Ч Son, EU Co = PRE Я 
whether a pixel inside th ected, when 8-connectivity is yg Ung sey 2 oR 
ЖУ, tie теа е же IS connected to a pixel Ө} ed to deter Of E 
, ing set i ; si 5 
d-connectivity is use H Я dt pixels well be и the Tegin Re’ ORE are the mean and standard deviation of the distance 
Я ine whe А ЖЕ Cle ; : D 
connected to a pixel outside the region ther a pixel inside the ML jd of the shape to the shape boundary. Circularity measure can 
j ШЇ ted by mean radial distance and standard deviation of radial distance 


are defined below — 


The 4-connected perimeter Pe, 
he mean radial distance is defined as — 


defined i and the 8-connected perimeter P 
Eg ate 


| adial Distance — T! 
ing = 2 Y о9о) бе) 
" 


1 Distance — The standard deviation of 


=? 
D 


4 {lfe Ce) € RelNg (re, Ce) py Rg + ф) 


Неге N is ће number of contigéitong (re с) 
гй Deviation of Radia 


ce is defined as — 


D 


(iv) The Perimet: 
| er Length 
perimeter Pr, E 


. egion — To n E" 

the pixels i f ompute length Ip, 

in Pg must be ordered in a hu o. эй : [Pel vt 
haw EX Teg , Ceg), 


M (eyo Cem- ) >, each pair of su i 
pixels in the sequence being 


neighbours, includin 
Then the perimeter length IPglis 


Ccessive 
g the first and last pixels, 


M — 1 lie on the perimeter | 


MS 


‘the set of pixels (Те Sem)» 


ularity measure C2 incr 


defined as — 
\ |: 
r and is similar for digital and \ | 


eases monotonically 


gion. The second circ 
ital shape becomes more circula 


us shapes. 


" 


Explain the term extremal points in 
Extremal points is the main properties of regio 
angle with horizontal and vertical sides that encloses er 
y its topmost, bottommost, leftmost, and rightmost points. In ig. а 
be as many as eight distinct extremal pixels to a region — topm 


ic) divided by the R), rightmost top (RT), rightmost bottom (RB), bottommost right 


| LB), leftmost top (LT) and 
bottommost left (BL), leftmost bottom (LB) Е 


region properties. 
n. A bounding box is 
ses the region and ar 


length ofthe perimeter 


[рь c : 
p i3 | left h ] point has а extremal coor 
| е ‚ Each extremal Dou 

rits B lumn coordinate position. Each extremal point lies on the 

ccur in opposite pairs such as 


However, for d 
" А ints О! е 
g box of the region. Extremal Pov) 17. Each pair of opposite 


š Я В В 
th BR, TR with BL, RT with LY pes es of the axis include its axis 


border pixels emal points defines an axis. Useful properti ial 
Е | or as the len points: Cent 1 points come from a spatia 
for vertical or horizontal moves and Р e. gth and orientation. Because the extrem : ваце formula will provide 

V2 for dia Counting | | digitization or quantization, the standard Euclidean є! w^ 


gonal moves $i 
ig tances that are biased slightly low. 
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Unit - Ill 107 
TL TR r row moment is defined as — 


! orde ре An 
1 mesi Ma “4 
> ‘W(te,Ce) = А DE -n) 


JA (re;ce)eRg 
| rder column moment is defined as — 
E E sl = 
SA Ele- ce)? 


| order row column (mixed) moment is defined as — 
EDU. “| T кы. шы 
Н,е) = A Le —Te)(Ce — Ce) 
d (re, ce)eRg 
uantities are often used as simple shape descriptors, as they are 
anslation and scale change of a 2D shape. The 2" order spatial 


RB 


BL BR 


Fig. 3.1 The Ej 
ight Extremal Рој, : 
For example, let consi ints of Region 


ance matrix has value and meaning for any two-dimensional 
istribution. 


i. Discuss about the mixed spatial gray level moments region 


Tus 1 lejas]. l 
X(9)- 100 | ay level properties consist the mixed spatial gray level moments. There 


o 2nd order mixed gray level spatial moments which are defined as — 
1 


: the extremal axis Mes =~ Me ~ Fe IM -ce)-H] 
: to е Point (те,,с e) is defined ü from extrema] Point (те, Са) А (rs, c, )eRg 
- xtremal axis length ~ ^ 1 = 
| Е — Qi = — Ce — с.) ^ c4) ^ u] 
(ii) Ue.) Be D e) [Ire — Ce 


These spatial moments can be used to determine the least squares, best 
level intensity planes to the observed gray level spatial pattern of the 


Rg. The least-squares fit to the observed l(r.,c,) is the gray level 


ity plane X(r, -г.)+ Y(Ce -e)* Z determined from the X, Y and Z 


‘Mat minimizes, defined by — 


= = 2 
2. ухо нуе, -60*Z- 16.2] 
(re;ce)eRg 


waere H(t.) Measures TOW variat; He)? H(r. c. ) and 


m ; 

Taking partial derivatives of €” with respect to X, Y and Z and setting 

lese E ial derivatives to zero leads to the normal regression equation that in 
а 


Г 


t» 
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this instance is defined as — 
Unit - 11 109 


n ES Le Ce) 


sf Р У (т. -т,„)? Leake 
свв _ ERN 9) А 
Xe EX i (e.c )eRg е Г X, mM (n, ERE ^ 
e-hXe-6) Уру? 5 
(esc )eRg е (Ce —c,) e)ERp e ^ 
Ye -&) (rec, )eRg d A Ee ~) H(resce) = n 205 = Te (Ce -e) 
(k»ce)eRg Le 76e) еей 4 (re, ce) eRE 
(re c, )eRg Ni 
d | (scs )eR, w that the unknown parameters X and Y must satisfy — 
(e-r) : у 
"2S T. )l(r. c.) | e) ШЩт.,се) X X. Hg) 
Ус, —c,)l(r, c ) ) Bose) ILY Ю(е,,в) 
j (тес, esve j à 
Yu c) ing to Kramer's rule, solving for X and Y, we get 
es 
Si (re ce )eRg Я 
> 2G =r) =0 and : r 8) тсе) Шыл) Рб) 
ў =0 апі. еы Е 
5 (e ce )eRE е п Èe —c.)=0 E сев) (сезсе) H(tece) Heeg) 
(ie, cg) e- <> put ће y= 
e>Ce)ERp se Values „Ш V (ries) Bn). (тусе) 


equation (1), then equation will be ~ 
(т.с, ) (бе, е 


Уе -)? 
Y6-z 


4 (e.c, )eRg Rr — 
^ 2: ге ай се; gs Tce) 0 xX 
V (Rem, x 600. саада 
1 0 (este)eRp ag t 0 2 Ү | = 
uM esM der Se RIVETS: 
EE ST (ANANSIS-S 
; APE NU 


Cesce)eRg | LZ і icm 


e from vertical projection, horizontal 
centroid of region, second 


. Signature properties obtainable 
tion and diagonal projection, include area, 
nts and bounding rectangle. These are as follows — 


п — We denote the vertical projection by Vp. It 


| СА 1 = З 
AG 20 fe 76е) =р i = (i) Vertical Projectio 
bitis | жй defined as — 


Recallin 
ig that, 2nd 

? Vp(Ce) X {Te | (re. Ce) € Rg} 
tal Projection – We denote th 


e horizontal projection by 


Order Spatia] moments Of re ons — 
gI ns 


(ii) Horizon 
Ip. It is defined as — 


1 
li cse ch сі 
: Cond = Die - 1%)? 
va Bs (fe c )e Rp of 


os 


emn 


Hp(re) = (Се | Ces ce) € Rg} M. 
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iti) Diagonal Projection — There 


| Te are two diagonal 
going from lower left to upper tight and | 


second, going from M 
lower right. We denote diagonal projection by Dp: The diagonal т leh, 
Dp goes from lower left to upper right, is defined by Projen 


Dg (D =#{(te.c,) eRy Ir. +c, 


Projeg 


is defined by 


Dp, (u) = "(e ce) € Ев |r 


(iv) Area of Region- The area A can be obtaj 
The area A is defined as — UM ce 


A= Xp =D 21 =D Hp te) 


(т.с, )eRg Te {сеть Ce)ERg} Te 


(V) Bounding Rectangle — Th 


e 
Tow, bottom ro 


w, leftmost and rightmost. 


(a) Top 


Row ~ It is denot 
Tectangle is given by "i 


Tmin- The top row of bounding 


Tmin = min{r, 1c.) ERg} 


(c) Leftmost — This can b 
ed 
column of the bounding Iectangle is given by 


[E 


enoted by 


тіп (c, [а,с.)є Rg} ы 


тіп {се | Vp (се) * 0} 
(4) Rightmost — This can be denoted by 


column of the 


bounding rectangle is given by Ries Tightmost 
E a AY e doe Cmax 7 MAK {Ce [(r, c, ) e Rg} йы 


i max (с, | Vp(c,) 0} а 


Y are divided in four Sections, tty 


s can be denoted 
unding rectangle rectangle is given by : si imar 
Imax = max {r, l(fes¢e)ERp} BEST om 
" = max(r, (Hp (r) #0} н | Ё 
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‘Region — The centroid of region include horizontal 
(vi) Mos o ion diagonal projection. These are as follows — 
vertical 


5 Row Centroid — The row centroid te сап be obtained from 
El B ection Hp, as defined below — 


Ш =: >. 
; A а„,с„)еКЕ 


= iX Èr 


Te {Cel(Te,Ce ERE} 


= iX х1 


т {се|(т,се)єКЕ} 


= ros Hp (te) 


e 


(b) Column Centroid — The column centroid c, can be 
'om the vertical projection Vp as follows — 


1 
eq, Ў 


А Gesce)eRE 


NDA: 


Ce (rele, ce) ERE} 


= iXe 2j 


со Gels ee) RE) 


Е ro (ce) 


1 
Би А 


е 
іа 
iagonal Centroi m | 
bs obtained from the diagonal projection Dp, 
e 


nal centroid / can bi 
1 
7 = —) Dy (I) 
3 T= x PL 


The diagonal centroid Į is related t 


p XX! 2! 


Т (ree) ERE енсе 


ly унд 


AUT. тсе) Кесе 


(Lower Left to Upper Right) — The 


Р o ће roW and column centroid in 


Scanned with CamScanner 


112 Image Processing and Computer Vision 


= iX 2,5 


1 esee) eR p [rs ec, =} 


em: 


l {(tesCe JERE Ite +04 =} 
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LO Yu 
= — е, 
P (eee) A eRe 


= iX EA -с,)2 


се {Tel(Te,Ce JERE} 


-iXe- x 


{Tel (Te Ce eRe} 


^A by +1 275; 


(re. c, )eRg A (re ce)ERg 


(d) Diagonal Centro 


23 2, 
Е n (Ce — ce)" Vp(ce) 
diagonal centroid U 


е 


ight) — The 
Left to Upper Right) 
1 Moment (Lower р We 
p i oi т) can be obtained from the diagonal projection 
f e # 
E ону 


ей а$ = 


1 D? T) 
-—Y(-D»pg( 
| Hu) = ro is 
Similarly, the diagonal centroid u is related to the Tow and column centroid г.с). (reste) and 
т . in following way — TET nal -inoment Шы) is related to He сг)» а 
р ies uem À econd diag 
б ^ А U-n-q UM 7 а 
A4 ; in following way 
sem" _ Oii) Second Moments — Second Moments also include horizontal 
я ps Projection, Vertica] Projection, diagonal Projection. 
ae 
T (a) Row Moment — 


prem y 
1 (te + Ce —Te 7 Ce 
=—) P 

A 


The second TOW mo 


1 {(tesCe)eR Ete tee d) 
Inent т.т) can be —- 
obtained from the horizontal projection Hp, defined as_ 1 Fite &)- (Ce - €] 
Ў == е 
: b. X { 
я | "eu go Riley? А к еке 
(re ce)eRg 


= <= by -7)? 


Te {el(tesce)eRg} 


ZNE 
— 
L yqs-RY 26. Ce 762) € (ee =C 
Te —Te 

A (e,ceERE 

1 — 
2 42. =r)? by 

Te 


“AKellterce)eRpy — ^ 


Li 


Шал.) * Dee) T Pese) 
mo 


ond 
ined from the sec 

d mixed moment can be obtaine 

the secon 


— Hence, 
1 = 
а pec ~ te)? Hp(t,) 

fe 


diagonal moment Hd.) by 


(b) Column Moment — Th 


Wu) - (тег) —E(ce,Ce) 
can be obtained from the Vertical project 


тсе) 7. 2 
€ second column Moment Hee 
ion Vp, definedas. (е 


Ce) 
Lux E 


4 1 
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Unit 


grid. In fig. 3.2 (f), the last step is 


(d) Diagonal Moment (Upper т, 
eft 
to compute the shape number. 


second diagonal mo 
ii . 
ent H(u,u) is also related to p fer 
(тес, 


its 2 difference 


ti 
0 Lower Righ 


D. 


e)» Far, 


Hau) = Ei 
АХ РУС 32 
{(tesCe JER Ele -ce =u) e Ce) 
Chain Code- 0 3 2 1 
Difference- 3 3 3 3 


Shape No- 3 3 3 3 


1 
"V 16-90-6907 


y и х (e, c, )eRE 
b 
fe 1 
A > AE -2(n-r. — 
(e ce JERE ЭСА LOIN e 
e -c,)? Chain Code- 0 0 3 22 1 
Difference- 303303 
ShapeNo- 03303 3 


= Hes) -2 
\ , Hence, the se Te) Hrs, ce) Шс, 
diagonal moment 


Ce) 


H(u,u) by 


Hcg) = ee) еы) Mayu) 
e) “Hey 


BE 0.0003003223222122 5 
а. 3.00031033013003 006 
Misano 00031033013 0031504 


Fig. 3.2 


7 ^ 4a its the numb 
Beo M sb with ee ae nt sh 
"AOR follows from ен numbers are an represent, the sha 
` Г shape nu. T the I" differen, з shown in fig. AE [st dme of order " 
rU mber u € of thi : 32 (b rences Fä 
Ax 7 sele suall 5 code. A (b). Th , and i 
| ve ku boundaries ms n because o though the dove Shape number | 
| - Shapen epressions compa the way the = А the resulting : j 
1 Spacing W: : | 
Spacing $ as | 
ometi - 
0.7. Discuss about the relational distance asa framework for matching. 
elations DA = (Rr» ^» 


aisa sequence ofr 
teger n; with Rj € А?! 


s of the entity being described, 
among the parts. Arelational 
to describe two dimensional 


1 description D 


ib where for each |, there exists a po 
for some set A. Intuitively A is a set of the part 
and the relation Ri indicate various relationship: 
description is a data structure that may be used 


_ Ans. Arelationa 


p= 1, es 


sitive in 


Subdivisi 
sion 2 
бло; of the basic 
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Оп = | 117 
th н ыа. () Rp (х, Dy =0 
н Se К i 

Will assum АЧ Boone one acs Dx and Dy, then EB. Qt (m 

y exists one-one, onto f with E(f) = 0, Thus fic .. -2 Px Dy) 
Parts to $ Us f is an isomo hi 
Buarante d Dy. "phism 
distance is a metric. © that 


Te], Dy = Rp (Dy, Dy) 


Assume, f be any one-one, onto mapping from X to Y. For a Rp D% Dy) 


ny 


a positive integer, the composition R o f of relation R with flings Rex 


› El 
On fis given 8 min Y | R; of —S; |--|S; «£7 p. 
Коё Hyjal YN) € Y" | there exists бк "wp f 2 s PER il 
j XN JER ў 
s К І 
with f(x )= " : Я ИГЕ „ре 
The function f maps parts from set X to parts from set Yale | F mip 2 Rief -S;|+|S,o(f1)—R, | 

error of f for the ith pais of corresponding relations (R; and S) in И » 

is given by ; "4 Хаму 


EU 1 
: = min? '|S; of -Ri |-|R;of^! -;| 
ЕЕ) = |R;e f - S. 41S. o £1 

s(f) = [Ryo f—S)|+|S; «£7! в, | 


cia 
B . E H Li l 
The structural error indicates how many tuples in К; are not mapped iyii min У! Sjof -Ri |+|R;o pli 5; | 
to tuples їп S; and how many tuples in Sj are not mapped b | ; : 


i=l 


DT LR, c X'i,S; c Yi and Т, CZ". Let fi CXxZ 
il Bi rie f that minimi 
relat The total error gives a quantitative idea of the difference between the two h NER ео Ox De) REIHE 
ional descriptions Dy and Dy with respect to the mapping f. 7 


Rp (Dx, Dy) between Dy and Dy is then denoetty | 


X5Y- fio f2. Then f is one-one and onto and produces some 
Rp (Dx,Dy) = min E(f) E 


ч ith respect to Dx and Dy. Assume x eR,» fj^ ғ, -8;. Then 


А and onto, 
It | : оа. fz and x eS; Since x c Ri fof, and ANUS. 
| 7 | : » Ld fs 
". шеш that, the relational distance is the minimal total error obtained i pi 3 - R; o f, such that (y) = (be f2 and {х} = 4} ° 0 
уш -опе, onto mapping f from X to Y. We call a mapping f that minimizs aunique ye Rjo tı = {y} ° f is an clement 
n а best mapping from Dx to Dy. If there is more than one b Бушр. en yc R; об - T; If y e T;, then (xj 
i 8, one can be arbitrarily selected as the designated best mapping- nee de в "i ri 
& 5 ; oie jef | To 
n. Moms Rp be the relational-distance measure, and let Dy Dy for each x eRjo fie f2 -Si either x eTi 
gis ary relationa] descriptions, then prove that hic : 
p (Dy, Dy) = 0, if and only if Dy and Dy аге їзотогр f, - Tj, we have Si | 
TOUT ) IT;o f2 7i 


ean a К ofi o f2 - Si IR; fi T+ 
= Rp (Dy, Dj) + Rp (Dz, Dy. 
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| Thus 


А E I 
EUER | РРО. 
M huoc - | 21 fio f2 SIS ARIS п-т 


MES or. i] " iof. 
т. l Similarly, we can show that Bus 9 
i I 
i Sio f;lo рі : 
ey 219 2 *fj SIS IT» f! -R + түз, E 
| i Adding, we get n ipo 062 7T 
9 1 , I 
е! УА; оо =S, PEN 
MEE Me EORR 1'~S; | 
zd ^ v Ne У iF у 
Pe tie P S? |R;»fj -T o £l I 
ї КМ У-У a ! +119 fj К IT of -s; |. T 
NER | Which says 5 faim s Ty 
roce MB E(f) wrt D 
a e ^ x and D «R (р 
Ba " р( x:Dz)+Rp(Dz, Dy) { 
: D(Dx,Dy) « E(f) 
so 
4 Rp(Dx,D 
, Тһиѕ the relational E Rp@x.Dz)+Rp(Dz,Dy) 
isomorphism, ce of two relation descripti p 


ere anA | Ox to another object ге 


Set X has the ordering < 
| E <А, ху, ons RP and the Suppose that the primitive 


ordering < y, 
À " У, -..., У >, 8 e 
d | hypothesized that primitive Xi ipe a during nus Wa Bus or 


description Oy 


PS one o H ka to prim tchin 
Mus f the following conditions holds — s Yj. The ord севе i 
А ering tells us thal 
be х ^а) Jp maps (оу, , x 
SN | У 
Ме; E 
Kurs ema 
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010. Explain various stages of view class matching. 
Ans. A view class matching can be divided into two stages, when a 3-р 
object is represented by a view-class model. These stages are as follows — 
`0) Determining View Class of the Object — A view class is 
represen sented by a hierarchical relational structure called the relational pyramid 
in which primitives appear on the lowest level and each of the other levels 
resents relationships among entities from lower levels. The full relational 
pyramid structure is for use in detailed matching for determining the exact 
pose of the object after the view class is identified. For rapid view class 
identification, relational summaries were derived from the relational pyramids. 
Ifthe relational pyramid has a relation R with c tuples given by 


ӨЧ, ti, j)-- (Na ;15,)] | j = 1,....,е} 
Here, t; j is a tuple from a lower level of the pyramid and N; is the name of 
elation that tuple t; ; comes from. 


he summary has a corresponding relation Ру with a single tuple 


n has four types of the form [ROCK, r), (BRAVO, b)], then the collinear 
ary relation has one tuple [((ROCK, BRAVO), 4], indicating that there are 
ollihearity relationships between a rock junction and bravo junction in 
drawing of this view class. 

The online system keeps an evidence accumulator for each view class, 
ized to zero. For exact matching, the system traverses the summary 
‘Structure, it adds one to the accumulators of all the view classes on the list 
‘attached to that tuple in the index. The view class or classes with maximal 
“evidence are selected. For inexact matching, when considering summary tuple 


4e, N,), c], the system adds eke to the accumulators of those 
UN}, ......, №), Ch 

t attached to [(Nj,... Na ),c + К] and ((Nj,., Nn) - К) 
for к= 0, ....., K, where K is the maximum amount of deviation allowed. 


E (i) Pose Determination with View Class — Once the view class has 
termined, feature correspondences must be found the determine the 
ble approaches. First is to use a general purpose 
lass, with.the particular relationships and 


View classes on lis 


been de 
pose. There are two possi 
matching procedure for each view c 


Ж сё} 
f 
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constraints of that vie 
| w class used 
relational pyramid structure i SPP ы 
stru i i d 
the feature matching а a к: relational si sd 
use higher level features ae level ofthe dL Ча 
can result in a fast match 5 raa e ower М 
. The second possibility is lower levels git | 
o develo О руган 
D асул. tt 


Solution, у 
can con, The 


procedure for each vi 
view class. A 
class, possib : нои preprocessi 
, possibly using training data or th sing program an ust 
eoretical analysis t alyzes ea Sie 
О determi ch y 
TMine le 


features, an Ok f 

en k ab 
, d then selects a sequence of features to lo Or in th 
Q. . ite b ief п te | | 
П. Wri rief note оп model database | 


Ans. An importan 
database of iD i t problem for robot visio; 
candidate models ко м stem allows ics die iS to organi, ji: — — = | | 
д aches to this SS to th © the д У - m TT) 
problem are available. Forex, ADING LIN E NITION z LAC ERSSTFICATION 
ре \ M f 
1 BY LABELING OF EDGES, RECOGNITION 
G LABELING PROBLEM, 


the original affine-i 
B e-invariant ч B 
all possible models in its E technique stores (m 

F average of y interest Serapis structure. If there are odel, basis) раг 

tabase is given by , then total number of suc a ia each | 
airs stored į : 
intl | CK 

et inodel in details. 


ALGORITHM 


uss about the fac 


= x 
= (y) E 
Ix-4 02 Disc 
The facet model is à powerful iool in image proc 
om edge detection, background normalization, sha 
dures involving det 


ime segmentation proce 
alleys, and ridges. The facet model principl 
e error between the image thought 


intensity surface and the observed da 
discretized sam 


i x 
Bro quite lar, j 
which can wW ge. Ifthe object models can be char: te 
ac rized by i 
their 


to image 


he image is С 
the facet model inc 


in the image 
у level surface 


it polynominal 


plane. 


lies - r є R. Let Pa 
are iteratively construct 


ed as follows 7 


e is 
of as a piecewise 
ta from the physical 


onsidered as a noisy 

i ise cons 
constant mode 
ed model, each 
el used in this WO 


essing. Its uses 
pe and surface 
ection of corners, 


based on the 


pling of the surface. 


tant, piecewise 
1, each 


region 
rk 15 


defined by equation (i); zey " 
+ Куху + KY 


Assume that 
i acl 
D, in A, define the зае been constructed. f(x, у) = 2 
istance of D, with - For any relational ipfi + kgx2y + KoXY + Kioy n 
T(D,,A)= x With respect to A is рју al description l ; y) is the gray level value at pixel location (x, y) whose neighbourhood 
n (D,,D). pa by e fitted. A local vector of the ten coefficients, computed as weighted sums 
uror Qv an values in the local neighbourhood, is found for each pixel © y.^ discrete 
= : у, i in T(Dy,A gonal polynomial basis permits independent estimation ofeach coefficient 
lia rab a ee De Щр, A) is used linear combination of the data values in the neighbourhood of (x y). Those 
can be repeated to ш. If the cluste cA use ear | ша vas E ee а = 
create a hierarchical stru rs are large, thi 4 nomials are given by equation (ii) f : js 
structure of clusters and PUMA process ined by taking the tensor product of the 2 sets of 1-D po ynomi >. E. 
Р цанус. » Let the discrete integer index set R be symmetric In the sense that г € 
(r) be the nh order polynomial. The discrete polynomials 
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D 2 
Define Po(r) = 1. Suppose Po(r), ...., Pis 100) have been defi ағ Shy 
in à e 
3 P(r) = гп ta, yr? +... +аг+аџ s and 5 2 а 
> Pa) must be orthogonal to each polynomial Po(r), „р "(i ube ot 
have the set of n linear equations TMC). We theg (уйк By 


0 k=0 

reR СО 
Solving for the 

Polynomials 


of those quantities are used to identify the region’s label. 
Ü 


n~]. 
set of equations yields the set of discrete оң 
sci А o 


short note on labeling lines. 


3 ting 
Bona 4.1, a cube Tes 
| , iM labeled with a + 
а convex edge, those 


Pii (r) = тРу(т)— Bac) ў 
where Bi- Утв з) р 


5 а – are caused by a 
Ув г?) g,and those labeled with | 
reR NN ised by matter occluding 
Po(r) = 1, Pr) =r ich behind it. The occluding 
The first five polynomials are given as ` : › the right of the line = : 
Po(r) 21 direction of the >, the Fig. 4.1 
P(r) =r А Table 4.1 Vertex Catalogue 
P(r) = r? -#2 
Ho 4 
BG = r? -| 4, 
ш 
( Ij p CA. 2]: k 
Ру (т) = г + H2l4 Hobs )r tole - ui i 
г if ." Нора -p3 hx Ы: ; 
where y, = De: | ` : 2 : 
—— seR eyes С Сга 
The facet model consists of. solving an equal weighted least square fitting 
problem by minimizing the error НЕ Meas, : 
; i "E : t 
; = D| dO- Уат 82. о 


i i reRL + . п=0; ХОРА, 3 
in terms of the a, coefficients. d(r) is the datd value obsérved (вау level values) i à 
The coefficients of the bivariate cubic of equation @; кБ k, can then be | [e 
determined. An error image describing the quality of fit is 

the ten coefficients k; defining the polynomial át pixel | 


м 


x 


‘also generated. Given, 


З :Осанон (x, y) а number 
of topographic measurements сап Бе determined. Image’ 


level, and invariant functions of directional derivativ. NEC 
the Hessian of the facets given by equation (viii), _ 
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occluded surface is to the 1 is 
С eft. I 1 \ i 
lowest lines with < instead of w : trates that a set of vertices or edges can define many different 
E nevertheless does convey three-dimensional information. 
rojection specifications, a wire-frame object may be constructed 
р {һе N projections. However, for a given object, there is 


©. 005 given : 5 д 
19100 2 of projections that can determine the object unambiguously. 


Owever, only 3/64, 3/64 
, , 4/ 
tually occur. Thus onlya al 


> Tes i 
fi К ресі 
faction of possible labi 


can occur in a scene 


Е Ее asons — 
t ; д ) They are related closely to surface features of polyhedral scenes. 


) They may be represented exactly; the noise and incomplete visual 
that may have affected the “line drawing extraction” can be modelled 
ompletely eliminated. 

ш) They present an interpretation problem that is significant but 


Explain the shape recognition by moments. 

5: Suppose f be a binary image function and suppose S = {(x, y) | f(x, y) 

present a 2-D shape. For each pair of nonnegative integers (j. k), the 
pt moment of S is given by 


OMy(S)- Xy 

(xy)eS б: 
00(S) is then just #5. Moment invariants are function of the digital 
that are invariant under certain shape transformation. For 2-D shape 
oh, we would like to have quantities that are moment invariants under 


tion, rotation, scaling and some kind of skewing. . 
The COG (center of gravity) (X,y) of S can be expressed in terms of 


Fi a 
18. 4.2 Different Views of Various C 


0.3. What 
br au do you understand by Jj, 


on defined as – 
d are often so easy t moments are е м0) 
ES te supplied” by оц У to supply as to pass unn. E _ _ Mio ind y= = 
У Our model-driven perception, `` noticed ih X ^ Ma Moo(S) 


Generalizin i . ` i 
E the line draw - Using the COG, we can define the central (j, к)® moment of S by 


BM: 
bik = e- x)Gy-Y) 
(x,y}eS 


x a SERE 
"Hes S 


Es 
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Th "m 

€ central moments are translation invari 
ant 

consisting labeling problem, 


»blem (CLP) arise in computer vision, in Al 


S* = {(х* Since if’ 
((x У") xt xp ys M 


bu PNCESS Oey 


hin Th = Hares _ ayes 
Moo(S) M g= х 


Similarly, У(5*) = ¥(S)+q 


an 
nt, namely, U, L, T and R, The first component U of the 4- 


nits U (1, os M} which are the objects to be labeled, The 
t L is the set of possible labels, The third component T is 
straint relation. T is an N-ary relation over the set U of units, 


x and 
Hj (S*) = Dix* x j 
-X(S*)J ~ 
? 4 берге (S*)] [y * -y(s$k | (uy, Yar 77 uy) belongs to T, then we say that units uj, иу ..,,, 
> je M X "M M | ^ strain one another. Groups of regions in an image thatare adani 
j ain P -[x(S) + pj/ (y*q- [ys е! line segment can potentially mutually constrain one another. 
) + q component R is called the unit-label constraint relation. R is an 


U x L of unit-label pairs. If an N-tuple [(uj, /)), (ш, 4), 
hen the units uj, uj, ...., Uy may be assigned the 
. Thus the elements of R are allowable labeling 
N subsets of unit set U. The only groups of units thatare constrained 
fuples of T, the unit constraint relation. A labeling of subset 


} of Uisa mapping f:Ü—L from Ü to L. A labeling fofa 


UN 


ye units is consistent if whenever uj, ш .... and uy are in 

uy) is in T, then Ku, f(u), (2. (02)),-- „бох f(x] Б 

oal of the consistent labeling problem is to find one or all consistent- 
the unitset U. 


E plain the following — 
i) The N-Queens problem (ii) The Latin-square puzzle. 
n be formulated as 


j= Eg ; 'h 
Moo Ans. (i) The N-Queens Problem — This problem ca 
N x N chessboard 


These momen 
ts are invarian 
transformations t under translati nt-labeli b is probl iven 
of th ation, sc 6 eling problem. In this problem, we are give! 
e form x* = px + — ie and in general, affine ard in such a way 
no two may be in 


| and “squeezing” The queens must be placed on the chessbo 
o sformation), Since if ееп can capture any other queen. This means that 
a У*)|х* z px +q, ys TOW, same column, or same diagonal of the chessboard. The N- 
a roblem can be modeled as a consistent-labeling problem in pes " 
board and the labe 
e will be exactly one 
h a queen is placed 


(1,2, 3, ..., N} is the set of rows on the chess 
o the unit constraint 


Alt normalized the coordi i 
Ü andthe 


normalized coordinates 


“Пу +t(x,y) eS}, E. 


then we have 


1, 2, 3, ....., N} is the set of columns. Since then 


y (a (yes 
=) Ex ү ^ ў row, a labeling will specify the column on whic 
у train опе another, s 


Re mj, (S*) = Geyer oy] | 
E. Mag (Sa | р 5, ow. Every pair of rows will cons Pee 
E эм У piix - х(5)]! гу. ut А ; um T will be the set T 2 (ui p Ej e U and u; * uj). The iiim 
«ба po, X) О. Т eT fe pM 
m à is : r (ву) (5) 2 } includes these pairs оё unitlabel pairs that represent vo eic 
к» E hessboard on which two ‘queens can stand without capturing each s ү 
. (li). The Latin-square Puzzle - This prob ens а s Edi 
N^ D ERE: i istent labeling problems. The Latin-square puzzle is an Nx N matrix WI 
E? i ] 
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N? objects that must be arranged on the Matrix, one 


e consider a 4 x 4 puzzle for ease of illustration, In this aig Square. W on of the back- 
Же " of four colours A = (pink, orange, blue, red} and has Si each Object; Wil search is shown 

4 {rectangle, triangle, square, circle}. The problem es Of four Shap 50, 
n2: ; such that each row, each column, and each of the two ш, ange the Ds of Back- 
9 main dia. Оч We denote, 


matrix contains exactly one object of each colo Bong, 
r and exactly One ls of the 


each shape. One way of modeling the Latin-square Puzzle ; bjeg а s 
squares of the matrix be the set of unit U = (1,2, |... 16} Th t the | pe unit-constraint Sree : 
the objects to be placed on the squares such as pink squar en the label, E, constraint 
red rectangle, orange rectangle, and so on. We can aptitesentT Orange trian bi 
product set L =A x B. The constraints are along the rows ^ the Cani 
diagonals, so we can model T as a quaternary constraint эе i and тай 
© Uy, 


f un 
| Uj, иу, из and uy all lie in the same row, column or diagonal) Th "2, Uy, wy) 
- The 


search (U, L, f, T, R); 


f 
b nae constraint relation R would then consist of unit- о: first (U); 
i Д 4 quadruples of unit | аре 0:15 > 
"TE form ([ur, (a1, by] luz, (аз, b2) [us, (as, bs )], [u4, (a4, bin wu "e п f ut ED); SIT tale 
МЕ h- из, U4) is in T. (ajbj) eL fori= I; 3, 4 and when i > gros Ma zu wy, kr E Ч Fig. 4.5 Backtracking Tree Search 
М , 591 j :Xh. ы ” . "m 
4 Q.7. Explain the backtracking tree searc. h. Also write м» *bj, each N-tuple (uj, ...., uy) in T containing component u and whose 
Lorithin, :omponents are all in domain (f) do 


Ans. A backtracking tree search Starts with the first unj 
сап i a unit of U. This yy 
= soon did each label in set L. Each of these potential е 
e 2 j e ыы The algorithm selects one of these ‘ie then are 

» Selects the second unit of U, and begi IE begin 

B U' =remainder(U); 
if isempty(U') 
then output (f^) 
else treesearch (U', L, f, T,R); 


uj, f'(u5)),......, (uw, f (и )))їв notin R 
then begin OK : = false, break end; 


ints. The process conti r 
ped Inues to level |U] of the 
path from the root node to any successful nodes at level JU] are the А 


consist i i 
sistent labelings, A simple digraph matching problem as shown in fig. 44, 


епа 


a subgraph of Ole 
for eu is eee is rua to graph G1. The possible labels end treesearch; 
© basis that indegree [f(u)] > indegree (и) and ' ; =a 
outdegree[f(u)] > outdegree(u) for all units u. gree (u) — ЗИРЕ питат | 
OJECTION, RAMMETRI 
3 e projection geometry. 


а .3. Write short note on perspectiv 
Ans. A pinhole camera is the simplest 
‘Imaging device which, however captures 
accurately the geometry of 

‘Projection. Rays of light enters doe | 
| infinitesimally small aperture. " 
‘through an infinite y ds 8 


а Pinhole 
perspective 
the camera 


i "the light rays with the à 
Itersection of the light ray: 


yrm the image of the object. 
‘om three dimensions onto tw 
"called perspective projection. 


o dimensions is 
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and let (a, b) be the coordinates of the Perspective projecti 
де image projection plane. Then а = x*/t* and P ifa c Е of 
с › Where 


0.9. Explain the one-dimensional Perspective project, 
Ans. The camera lens is at the origin and points direc ection, 


y down the 


E shown in fig. 4.7. In order to keep the image i iti i "axi, 
that the pd line is at a distance d in nt m + eine (e ien, ie. ae "rok й Nu _ i ==. 
A rojects forw: E a S and e ү); rojection Теа 
poo = жез (з s mE ers E Hp of left-right eo leng 7 fio o o 100 011 re to Lens 
- $ ў According to the geometric ray optics model for р as 10 the sed a doo 1l T E , 
- я i © pem the image line, which is a line parallel to н, M fic, ES 0 ИШТ 7 i 
Mise us A in front of the lens. The distance d is known ith шаа cab. 
е = on on the line is determined by where the line fr, aca ало 
ter "M s А ects the imageline.Hencethe ү, om (Mn) E 
perspective projection has coordinates (md/ rer на | 
n, d) in the original two dimensional | | 
Coordinate system. Relative to the one- = 
dimensional coordinate system of the image a z zo кс: 
— Zo 2—20 


line, the coordinate is md/n. Note that, both 
the numerator and the denominator of md/n a 
are linear combinations of m and n. If the ` 
aes and denominator were computed 
м appropriate linear transformation; the 
Pee oe Coordinates could be da ONE cii 
S ned БУ 2006 ratios of components of | Fig. 4.7 
ormed vecto; ES 


sider, a point whose perspective projection is (a, b) in the 
ystem of the image projection plane that is at a distance d in front 
ens has 3D coordinates (a, b, d). Since the camera lens is the 
sing through the point (a, b, d) and the origin consists of all 
of (a, b, d). Furthermore, since the origin is the center of perspective 


on (a, b). We call the line L, 


era 1 


: х х а 
s т= 41у у= МЫ 
33 РА 2 d 


i 

зе perspective projection of the point (a, b). 

12, Write short note on photogrammetric terminology. 

Basic terminology used in photogrammetric is are as follows — | 

_ (i) Exterior Orientation — The exterior orientation of a un n 

ed by all the parameters that determine the pose of the oe see 
ference frame. The parameters consist of the position 0 - s Mrd 

i pectivity and the direction of the optical axis. pde p E on 

tation therefore requires three rotation angles an ы 

fers and it is accomplished by obtaining the 3D coo mia 

| points whose corresponding position on the image в ia E: ai 

(li) Interior Orientation — The integrior piden кше 
Pecified by all the parameters that determine the n Miei 
ее dimension rays from the measured image coordinates. 


s afa] [1.0.9] [1:07 o pig Зз 


" Е ive tra ee 
пе шшен coordinate system that assumes an 2118190015 done by 
e lens.-Let (x, у, z) be the original.coordinates:of a point iid position а 
space. Let (Xo, yo, zo) be the position of the lens, called S dimension 
Re center of 
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of interior orientation relate the geome 


try of ideal Perspective p... Unit- у 433 
Е physics ofa camera. The parameters include the camera сопь ton fon ch the point where the similarity 
б a point and the specification of the lens distortion, Complete speci еі Е E мааа Matching methods such 
"T AT : orientation of a camera is given by the interior and exterior Mei F thy КЕ БОД and so c Although тео, conditional entropy 
EI к, ч (iii) Relative Orientation — The relative orientati tons, d ists some shortcom; el matching is the 
| ( \ entation of опе c , there ex | ortcomings such as huge computation 
relative to another constitutes a stereo model and is Specified by five s... en f matching and sensitivity to rotating, scaling and dee , 

— three rotation angles and two translations. When two Cameras Parameter S ) ` 

: orientation, each pai 


The common image feature includes 
Closed area, statistica] moment, etc, 
ture extraction algorithm can be divided into three main classes — 
in t feature extraction operator such as Forstner operator, Harris 
nd Susan operator, the second is linear feature extraction operator 

ny operator, LoG operator), and the third is surface extraction 
y through region segmentation, Generally speaking, feature- 
hing has the advantage of being simple to operate, rapid matching 
high precise matching rate, but it also requires human intervention 
ining of feature points is a bit difficult. Besides, it is only suitable 
" images with significant geometric features. 


orientation of each camera is known. 


ЕР бру | ; ee RS н Dis. bout the intensity based matching techniques in 1-D signal. 
| e IMAGE MATCHING — INTENSIT Y MATCHING OF 1D s aa ^ x unita directly refer to the model equation 
L... SIGNALS, MATCHING OF 2D IMAGE, HIERARCHICAL Mig based: 

Ue А na 5 — IMAGE MATCHING > 


a EIEE 


gt 3 (0) 
Bic’) = Ti {g"[Ta (r^, c pg Epi) ЕЕ 
estimating and evaluating the parameters PG and р. To ane 
ie principles, we first develop methods for matching ID e : onis 
! is demanding when taking the statistical properties m же 4 
we restrict this discussion to the case in which one o M. еи 
о be perfectly known. This situation is relevant to an obj 
"dure. The model then can be written as | " 

P BG) Т.о Сузр р) + nG 

t F 3 А | = replaced by g, f, x and y respectively, 
CM d explicitly. Let f(y) is 
| e that allows one to 
st derivatives 


Q.13. Explain about the image matching, 
Ans. Image matching is 


an essential and difficult task in digital 


ё observational noise component n(x) is ш 

у sampled data and a fitting or кор а 

ived or estimated continuous у) acus ар А 
te i is is just like the Bonnes tie 

E lytically. This is Just. SS mU 

NM ar the estimation problem, we alway: 

L] 

Br. 


xtural images. 
' A lot of efforts have been devoted in the fj 


prove the reliability, automati 
matching which can be generally divided into two cl 


primitives. One is area-based matching and th 

(@ Area-based Matching — Are 
directly on local image windows, and it can 
uses the grey value ofthe whole image to m 


ИО Values p® and PQ are known from some i a OK 
ГОХ: v С ; г 

JM TI \ initial approximation permits we iss rise to better 
боп scheme. This blem whose solution then gives rise eee 
em by a linear substitute pro [у arrive at an iterative solutio 

yr is wav we naturally a . er effects are 

Ms. In aita dee always assume that second-ord 
Minear problem. We th 


otespondences. It 
ШУ Of two images 
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sufficiently small specifically, (i) when interpola 
neglecting the random nature of f if it is derive 
deriving variance for the estimated parameters 


Unit - Iv 135 
E angulation. The SIFT algorithm is proved to be able to Produce robust 
ji sparse corresponding points invariant to moderate seale chan v 
which is ideal for the purpose of generating a certain number of wel 
matching points for the initial triangulation, In the SIFT dotium 
tpoint is characterized by a vector with 128 unsigned eight-bit numbers 


ating for its derivat, 
from real data, o, | 


» Or 


Q.15. Write short note on epipolar geometry, 
Ans. Image matching can be tremendo 


sly simplifieq if the 


e. orientation known as the two-diamensi М ; i i И 
т Sii бае by the so called ae. кзы Space is reduceq шө ей from a local region, which defines the multi-scale gradient orientation 
dimens nal r etry inherent į a ong pes? tching is performed by measuring the similari ee 
1 » be Seal setup of cameras is shown in fig. 4.8 m2 Orienteg E ане with he two ditus Mele лл 
к centers A' and A" form the baseline of length b, the princinay ^ Projectig E i li 
> Principal poin n RANSAC approach is used to detect and eliminate possible blunders 


are assumed to be the origin of the two image coordinate Systems ( B амр 
u'y’ 


(u",v") derived from the pi 1 ' 

| уе: айбар pixel coordinates (r',c’) and (т” or »V) ang 

| interior orientation. The object point P(x (r >C ) by using the 
I P EOM IN * > У, z) then is 

s | (ш, v) and P"(u", у") in the image planes p' and y" 15 rapped into 

: | H geometric model of the perspective projection — speci fically oo Of the 
f 


e. previous SIFT matching results. It starts by randomly selecting a 
ofthe matched corresponding points. From the chosen matched points, 
ental matrix can be calculated based on which a model is then built. 
i "model is evaluated by determining whether each pair of corresponding 
Я reasonably well to it. This is used as a criterion to determine the best 
model hich has the largest number of correct corresponding points. This 
process is repeated to find the best model. Those matched points which do 
{fit for the final best model are considered as blunders and eliminated from 


al matching point set. 


e collinearity 


condition — i ‘AUD > 
оп — the five points P, A ,A",P' and p" lie in one Plane, the so cal} d 
d e 


А epipolar plane є (Р) associated with Р. The intersection 1 


E í baseline b - (A'A"). The epipolar he. ofp lanes p assing through the the seed points are extracted at the top level, an initial triangulation 
. і which are the intersection of the baseline b ни is E e F' and F, constructed and an area-based image matching with feature points and 
W апі p”, ivel д : cts wi e image planes oints i ducted at the top level again. : 
| respectively Thus in general epipolar lines are not parallel. Eo om s E 
f P 3 (x5 Y» x) 
| z^ 
хору! m 


е 


В (х), Y2) 


| | 7 Шах using TIN Surface 
Fig. 4.9 Interpolation of x Para edd 


. ; tc 
"Inthe process of hierarchical strategy, image та to the next 
the Deut resolution. The matched жс өтөр, УЗЕ gem be matched. 
tvel (of higher resolution) where additiona nal image level. At а 


Bee to the origi sic 
NM Cte repeals TM qa dare matched again to achieve higher 


р F ints fi m upper i 
preci: jon. ue Кариш | regular Network) surface of parallaxes 15 
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generated from these matched points using the Dela 


Ы i i unay triangular; 
i TIN i "a liia the correspondence of additional feste О, Thi, is method finds points of high curvature iris ves 
s shownin fig. 4.9, a point X(x, y) is insi Point, i is the med; s uncary, 
Р : Е (х,у) isinside ofa Delaunay triangle h E ми e el median point ina sequence of 
by points A(x}, ¥1),B(X2,¥2),C(x3,y3), the X parallax is į Omeg <À wane An | а pU1= 1, п, аге all local maxima. In 
j ; nterpolate dh jt detects not only very sharp corners but also points of high curvature 
» onthe TIN surface formed by A (XL yr Pj) B'(x5, Узр, Сүх ased e boundary that are part ofa section of approximately constant curvature 
2 3 Узр), (ii) ‘The Chain-code Representation — Refer to Q.2, Unit-2 | 


(iii) A Sequence of Line Segments Representation — This is a third 
representation for the boundary of a 2D shape. This Tepresentation is 
used after the original sequence of boundary points has been segmented 
f line segments representing near-linear portions of the boundary. 
equence of line segments has been computed by some method, it can 


The x parallax of point X(p,) is interpolated as the x Coordinate of 
© of point X 


which is the intersection between the 3-D 'B'Œ 
t plane A'B'C and the line Xy 


parallel to z-axis. Parallax in y direction (p) 
Р сап Бе са] і 
strategy, only using у parallax as z abordingtes. Fda, Pra, “am 


; ; г the esti 
corresponding point coordinate (x', y^) is defined as — ы. 


х' =х+ . 
_ Рх Р A model for representing the matching sequences of line 
y Sy py Е [s was introduced by Davis. According to Davis, a line segment sequence 


ence of junction points (Xi, Yia) where a pair of lines meet at 
location (X;, Y;) with angle magnitude aj. Given a sequence A= Aj, 
of junction points representing the boundary of a model object A and 
equence B = By, B; ....B,, representing the boundary of a test object 
oal is to find an association E = (1, 2, ...., m) => (1,2, ...., n} U {missing} 
es i<j = E (i) <E (j) or either E(i) = missing or E(j) = missing. Davis 
[constraints on both sides (line segments) and angles to define what is 

га best mapping for this problem. Let Мі, j) be a local evaluation 
on that measures the goodness of the match of junction i of A to junction 


ased on the difference between the angles о; and oj. Let Si(i,j) bea 
: of the consistency of mapping junction i to junction i’ and junction j to 
ў', based on the difference between the segment lengths of B;B; and 


Es iex 
| OBJECT MODELS AND MATCHING — 2D REPR єс, 
b phun, MATCHING — 2D REPRE: | 


The cost of a mapping E is given by — 
mm 
ct = X Mt EG) + YY Si [EG EG? Pmp) + (тд) 
i=l 


i i=l jel 


ere, P is a penalty function for missing angles. 


~ Q.18. Explain the following terms — 

(1) Global feature (ii) Local feature. | 
‘Ans, (i) Global Feature – A 2D object can be thought of as man 
nage, The pixels of the object have value 1, and the pixels outside = jec 
have value 0. Because of this relationship, it is natural to represent by using 


me of the same features we used t 


о represent binary images. Commonly , 
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used features for 2D shape representation include а 
circularity, and elongation. = Perimeter 
(ii) Local Feature — A 2D object can also b hs 
local features, their attributes, and their interrelationshj à 
used local features in industria? part recognition are num : 
can be detected by a connected component Не. 
tracing or, if the shapes of the holes аге known in ы 
operations of binary mathematical morphology. Local oa 
into some type of structure for matching. The most co У 
is a graph whose nodes represent local features aa 
measurement and whose edges represent relationships a 


Men, 
2) 


Charact z 
eriz 
The Most а yi 


Co 
San Ting, 
: Comers, D 
E 


anc 2 Sound, 
ӨШ е, through ^s 
Ed = һе Оры 
thej d Struct 

1 Droperi. т 
mong the feature” 


TION, CONTROL STRATEGIES, INFORMATION 
PAIN, ЧОЕК 5 M 
TROL STRATESIES ATION 


VE 


Fig. 5.1 shows the typical architecture of 
-based Computer Vision (KBCV) нано 


gh-level processing. At low-level Visual Knowledge 


| ] and hi 
Model 


processing, the raw-data is processed by signal 
sing algorithms, usually generating feature 
ions, sets of symbolic desciptors which 
izes characteristics of data in a quantitative 
ples of low-level processing algorithms 


nm 


+ . . " 
mentation, color detection, texture detection, Low-level 
Processing 


duction, occlusion detection and so forth. 
gorithms were the main focus of research Fig. 5.1 Typical 
yuter vision for a long time and had come а Architecture of KBCV 
y in terms of performance. 

n the other hand, high-level processing is re 


lated to the interpretation 
n top of the low-level 
put and generating 
sual data. This is 
g can also act on 


soning with visual data. It is usually built o 


Ssing algorithms, taking features descriptors as inpu 
qualitative descriptions about the content of the vi 
the high-level processi 


| content descriptions. Ideally, | 
W-level processing adjusting its parameters to improved me ВА 
n generated content descriptions, creating à kind of feedback оор 

up and top-down reasoning). 


ually called bottom- | 
pie implemented using the various forms of 


"3 


У есп the two levels (us 
ugh-level processing has 
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symbolic artificial intelligence, from rule-based to prob. 


ТОП abilist; a : f knowledge can be expressed with a different rule. A 
usually employ some sort of a priori knowledge about the visu ет, The ch piece an contain hundreds of production rules. It is very easy to 
to be made. erret r remove а production rule in a knowledge base. This allows to 

yo 


0.2. Explain in detail about knowledge representar; l dily tea knowledge base. Production rules can be grouped in different 
ation, 1 e knowledge and help the reasoning. 


ower of expression of production rules relies on the kind of logic 
bor For rules based on propositional logic symbols represent all 
н everything is a constant. It was the case in the previous 


Ans. A knowledge representation is a set of Syntactic а 
conventions to describe a piece of knowledge. In order to buil e Sem 
bases the experts describe the contents of their knowledge u "hls 
representation schemes. A knowledge representation scheme my Лоу], 
explicit what is important and must be easy to handle. Two main dae ty 

е ge 


representation schemes are presented in the followir 
hg, producti 
fines. 8, production rules and 


аш 


1 
Sing Edge 


0 


Aes used; the values can be referenced. The rule R2 shows ап example 
j r production rule, where the value of the temperature in the room is 
] А ee i h the attribute room- 
(i Production Rules — Production rules are ctly given in the rule but referenced throug 

predicate logic and the inference capabilities of modus 
tule very generally expresses an “if-then” relationshi 


production rules are as follows — 


directly inspired from 
ponens. A Production 
р. General syntax {у 


ure. | 
ple of 0+ order production rules are given below — 


d omment : “use the heater if the temperature is too low 

comment : *a co а я: 

if mment explaining the rule”, room-temperature < 19 
condition | 

change heating-status to on. | i | 

and/or or rules based. on first-order logic or predicate logic quantifiers and 
ш les сап be used. The rule R3 shows an example of first-order production 

y | iere two variables M and N and one quantifier, 3, are used. 
conclusion whe 


ple of first-order production rules are given as — 


Dre means that if a set of conditions are true then the Bolo 
. FOr instance in the domain of the bridge game, if we want to expres 


that during bidding one can o i i 
pen with a specific distributi 
can use the production rule Rl bridge – De 


E 
E comment : “track mobile regions which are possible human 


df 
3 mobile object M 
and 


Rule Ri bridge ine shape of M = human 
comment : “bidding rule for Opening” and tres 
if 4 d 1.4 metres < size of M < 2 metre 
balanced hand t "then А А 
апі f M track M on rules are simple, in particular in 


n say that producti 
1 logic; they are ve 
f a knowledge bas 
contrary they hav 
e uniform formalism 
r problem solving. f 


12 to 15 head points а: 
then е ж НИ 


орепїпр 1 no trump M 
This rule means that if we have a balanced hand ad 
- head points then we can-open at the level of 1:no trump, 


In conclusion we cà 
@ case of propositiona 
ow fast modifications O 
ovide to a user. On the 


i d, th 
owledge is fragmented, 
s to А lack of efficiency fo 


ry modular and readable. They 
and explanations are easy to 
e different drawbacks — the 
for expressing the knowledge 


between 12 and 15 


me 


VR r 
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(ii) Frames —A frame is a knowledge representatio, 
comes from cognitive research on human reasoning. The h m 
human beings refer when reasoning to prototypes already bd о 
and compare them to objects or events corresponding to Bs. | 

The concept of frames is a declarative knowledge represe и 
well adapted for structured Р DM 
object descriptions. 

A frame is a set of 
attributes (or properties). 
Each attribute has several 
slots which describe the 
characteristics of the 
attribute. The attributes are 
very dependent on the nature 
of object it represents. The 
slots are predefined general 


тем 
SIS is thay 
n Memory 
ations, 


lon Scheme l 


Table 5.1 The General 
Frame Frame infers values for parameters, and then tests to verify 


Attributes 


type | type-value 
value .. ` | -attribute-valye 
default default-value 
‘type т type-value 


value value 


f i lates an importa 
each of which encapsu ajor asp 
jem. Then each subproblem is successively divided into more | 
^ 7 B "n | . 
i problems. The refinement continues until the most refined 


can be solved directly. 
egy — This strategy is used for problem 


1 Strat R 
Blackboard Contro « of the inference mechanism ; 


range-of-values | range-value 
hich the various component 


characteristics useful for any | type type-value mm 

à - à i torage area B 

kind of attribute; classical value attribute-value with one another шша A h s uis emi one f 

slots are the | the blackboard has su : 
type, the current range-of-values | range-value lackboard. When ke a deduction, the inference i 


Е : 1 а А 
value, a default value, а of the inference mechanism to m: кн blackboard where it ү 


ossible range default default-value writes its result к í 
Framel is эже. ve 3 uum othe components of the inference mechanism. In 1 
P ‘ample о vailable for à ssively propagated an d the required j 

BA ERE врба, апа type-value the inferred constraints are succe Е 
4 kind of slots. A frame is sans fermen de more limited. БЕЙ 
useful to describe a general attribu ys h to the information integration problem has 
concept, as the concept of a rangeof-value | default-value 4 ich approac 
chan, default default-value UM information integration problem i 


i th 
ae Servisi m dee n approach to informauon ica 
T о , $ 
а in 1987. Pearl defines а Bayesian ber nee 
cyclic graph whose nodes represe des of ts graph are 
j ^ jable A; has а finite set of 
Ap A2 cen Ам | AL ict 
е values {а}. An are (Aj Ai i Р 
ГА. is a direct cause of the Y 


Describe various image control Strategies. 
ns. $ ies in i 
ome control strategies in image processing are as follows — 


i) Bottom- 
solving Ss is н ~ This strategy is used for problen 
only uses general pole кы object models in its early stages 22 
vision system using a bottom-up шо pe being sensed. In a compute 
interpreted and aggregated. The ems 
successively manipulated and aggregated unti 
description of the scene has been generated. 


the observed image data P 
and aggregations are thé? 
1 a sufficiently high lev?! 


(ii) Top-down Control Strate; i 

| | à 'gy — This strategy j 

solving that is goal-directed or expectation Pues E used for p wt 
generated or hypothesized. Assuming the hypothesis is ab Кот 
information іп the knowledge data base, the inference mech 


he support set of a node is 
node A; has, support set 81, | 


true and using 1 
anism then infers, if 


di 
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ieee) of Aj= aj, given the set of value {s;} for th | 
istribution corresponding to the entire network is соон Vatiab| a= оуу ayex@ LPC \с,&)т д (nA (d) 
п es. Th AST c,d 
P(2,,22,.....85) = ; [s r 
bå2» ân) Ie; Isi) nstant that makes YB)-1. 
The matrices of strengths on Е | z 
й ach of the : : : 

the network. As information propagates causal links are static ; s смс | 5 | 

input t GH TRANSFORMS AND 
NITION METHODS, 
HAPE MATCHING 


through the network, the belief value 
B(a) of each proposition A = a changes 
The dynamically changing belief value 


Desk, 

Workstation 

What is object recognition ? And its main components. 
in the field of computer vision. 


B(a) denotes the probability P(ale) of A 
=a, given all the evid Sait Teleph 
far. The idea for a poa of xa E MEME mid "Ob t recognition is а technology 
a belie Key Board ’ ject те 
T recognition system finds objects in the real world from an image of 
dels which are known as priori. This task is 
sly and 


Wh rli using object mo 
ere, nodes M, N and O re ; difficult. Humans perform object rec 
STI , те ] 1 * а . . 

E E possible labels — object M раа bis identified. Each objet y. Algorithm description of this tas 
e set of a workstation or a desk, obj | f i i t steps in object reco ition and 
d med ш or keyboard, and object О n or a desk, object N oN І difficult. The раш = a аА e 
al. Labels of *desk" for can be a telephone ora some techniques that have been use or o e AM jon int y 
. The architecture and main components of O ject recognition 1s 


ognition effort les 


network to be used in im: gi F ortion o 
Гая: 4 
age analysis 1S ig. 5.2Р, 7 of a Belief Netw on on 
0i m 
k for imple entati 


shown in fig. 5.2. | 
rk e 


M and "pushb 
uttons" for N t 
support the label of “teleph tend to 54. 
and labels of ер опе” for О, apa 
“keyboard” workstation for M and Feature Features | Hypothesis Candidate Hypothesis Object 
label icd tor N tend to support the Detector Formation | Objects Verification | Class 
belief rminal " for O. The model of 
the PU жөнү defines how beliefs in 
ork are updated as n i 
enters the ew evidence Modelbase 
system. The idea for a T 
Fig. 5.4 Architecture of Object Recognition : КЕ 
els known to the system. 


ins all the mod 
on the approach used for the 


propagation of beliefs thr 
ough a pi 2 
piece Fig. 5.3 Propagation of Beliefs 
ription to precise 


of a belief network i i 
is shown in fig. 5.3. through Piece of Beli of Ni h 
a Belief Networ 


Here the nodes C 
and 
are successors of node A D are predecessors of node A and nodes W and X 
a es W an 


tion in the model database depends 
itati nctional desc 


ti ma 
on. It can vary fro 5 of objects are 


ce informatio 
A feature is some attribute of the object that is 
izi ject in relation to 


ribing and reco 
8 used features. 


Let P(a | c, d) be th 
€ fixed conditional probability matri ha 
x that relates the 


variable ‘a’ to its i А 
d» immediate parents c and d. Let т ( 

ofthe causal support contributed by an inc ‘A(c) be the current strength d shape are 50 aes 
сей : oming li - and identifies ocations 
from A. ied e" diagnostic support nd A, Let Ay (a) be t^ БИШЕ pipi The features used by 8 
нег children n ah d ШОО by an outgoing link poH som to be recognized and the organization 
ча эшн d as diagnostic support represe: 8 forward from parents ) ted features in the image, the hypothesizer 
. Updating a node A thus invol nts feedback fr ild ) cece :« step is used to reduce 

Ves updating, om children : eae piects pre . ne. This steP A 
n signs likelihoods to 0 jects рг н del base is 
r the recognize limination of unlikely 


he search space fo 


g not only its belief 
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object candidates from 


е ossible ; 
models to verify th p considerat; 
e hypothe Tation, Th 
system then selects the des ae refines the lie tier th Unit-V 147 
e А Н e S T 
ше T the correct object. the highest е 090 of gt Obj Text Recognition — Text recognition deals with recognizing 
All object recognition systems 900, азыу В, А pols, individual words and series of words. Ex- recognizin 
p feature detectors based on ke models either ех on al ^ gofa person. B 
and vi . > ese obj plic: е d P 
kei TS components vary in their Ject models, The h tly or impli Hand Gesture Recognition — Hand gesture recognition deals 
ject with hi Tal y hypoth erent ati 2. 
approaches are a xem a od as the correct ee sormation Bur CN уоп Also wrila is панно 
e of thi ect, 4 : 
Systems, on the other hand, r i of this approach, Ma P atten hae БУ yl Hough patented the technique of Hough transform in 1962 isa 
work in the verification piis 1 "i on the hypothesis 2; artificial in ction method that can be used in image analysis and digital image 
matching, bypasses the iypofhes eus one ofthe classicale ation and toe Оте aim of this technique is to produce a computer vision system 
Ok Wisi Is formation stage entire] PProaches, tae ect arbitrary shapes within a sample image. The main purpose of 
c are the challenges faced in obj у. Plate 115 finding imperfect instances of objects within a certain class of 
e challenges faced in obj ject recognition » oting procedure. The classical Hough transform was mainly 
i i the identification of lines in images, but later the Hough transform 


positions of arbitrary shapes 


(1) Change in si iti ; 
ge in size, ; Ion are as 2 
follows — m odified and extended to identify the 
ion indulged itself in finding 


most commonly the extended vers 


might change b & uracy of the $ ound are s 
y scaling the im ystem. The асси OME of the 
йв (ii) Adjusting bri ls Tacy of the model In that case appropriate parametric representation is needed. 
p. ifficult for the system to a ess and contrast of the im are a wide range of areas where the Hough transform can 
|. f йг (ili) There may be Cognize the objects in the rks © may also make jt ted successfully such as in medical visualization or in order to 
f or the system t cases when the obj ge accuracy in face recognition etc. The characteristics of Pupil 
& Lus О recognize j € object mi T ee твор . | up 
d these ca gnize it. The object att = not be visible enough r uncontrolled illumination can also be obtained by Hough circle 
10n system must handle objective spinal motion imaging assessment system (OSMIA), 
the positions of 


Ses of low Visibility, 


(iv) The s 
| А ystem 
| groups and are too small nds 


to locate marker that can be used in determining 
bodies. The measurement of vertebral motion has been a challenge 
r many years but now several automatic 


fail in Ses whe occur in 
Ca: imi 
h Te similar objects C i 


MP T i 
M k (v) Vario d z Hes P 
p г 54 Pose difficulty for a lightning conditions and h ; ‘of biomechanics fo 
š System to recognize th shadows in the image may also to these problems have been developed. The Hough transform has 
e object ntroduced in morphological image processing to detect nad estimate 


er of red blood cells in the blood sample image. 
9st common application of using the Hough transform is the linear 
tor detecting straight lines in an image. In the image space, the straight 
e described as y = mx + c where the parameter m represents slope of 


and c represents the y-intercept. This form of representation is called 
ht line. The main idea behind using Hough 


lope-intercept model of a straigl 
rm is considering the features of the straight line not as discrete image 
rms of slope-intercept model. In general, 


(Xi, y» (х2, уз), etc., but in te С 
ght line у = mx + с can be denoted as a point (m, c) in the parameter 
where m represents the slope of the line and c represents the intercept. 

Advantages and Disadvantages — The major advantage of Hough transform 
at the pixels present on one line need not be adjacent to one another which 
exceptionally valuable when attempting to perceive lines with tiny breaks 
em due to noise, or identification in partially occluded images. 
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With respect to hindrances of Ho 


ке LS ы associated а е it may if es in blemished image. In the Hough parameter 

values, as opposed to а a infinite lines indicate, Ruy | Dune ccc зи СЕ 
0.9. Discus. : with definite termination ^d by беу ty d apta А К ; 

s about the line Hough tran shin Points, (m, ) Ға point іп the x-y plane to the parameter space may be 

т. as the pasic behind the general Hough transform. The parameter 


Ans, A suitable equation for describing 


X cos0 + y sing = S nbus... d by space of the object of interest. 
r Metric form : 
is. 


is difficult to represent in parameter space, contrasted with the 


rmal from the oriai 
: 1Z1N to this |; form: : i 
b X-axis. A suitable line ang 0 ig е easily. The equation of the circle is — 
А t 
ig. 5.5 which shows y of describing às г2= (x- ш) +(у- n 


are three parameters of circle, where (m, n) is centre of the 


an object 
metric Tepresentation ae азы eT andr 
) А à р 2 
ў ine, vein the direction (x, y) respectively and r is its radius. 
Circle can be represented in parametric form as — 


х= т +гсоѕ Ө 
у= п +геіп Ө 
A Along these lines the circle’s parameter space shall fit in with three 
‘mensional, though the line just had a place with two dimensional. As the 


у d 
quantity of parameter expected to portray the shape increment and in addition 
the measurement of the parameter space expand, simultaneously the Hough 


‚у 
у too increases. Therefore straightforward shape with 


Fig. 5.5 ; 
Gradients Of the Example transform complexit 


TN | Object Fig. 5.6 Parametric Representati parameter fitting in with two dimensional or at most three dimensional. The 
known €n an image is analyzed th of a Line тшп parametric representation of the circle, the range can be held constant or a 
pixels th e (x, Y) paramet fixed number of radii can be set. 


at are selected for 
At every edge-point we draw a circle with centre in the point with the 


craved range. The drawn circle attracts the parameter space, such that our x- 
aris, y-axis, and z-axis are m-value, n-value, and the radii respectively. 
Accumulator matrix has identical size as parameter space. Value in our 
accumulator matrix is incremented. Hence, we clear over vitality edge-point in 
the information image drawing circle with the wanted circle with preferred 

i increasing and updating the value in our accumulator. Now, when each 
dge-point and each preferred radius is utilized, then we can get the number of 
circles passing through the individual coordinate stored at the accumulator, 
lence the highest number relate to the centre of the circle in the image. 


— Q.11. Explain in brief about the patterns and pattern classes. | 
4 3 Ans. A pattern is an arrangement of descriptors. The name feature 1S 
d often in the pattern recognition literature to denote a descriptor. A a 
Class is a family of patterns that share a set of common properties. an 
“Classes are denoted pj, Pz» ~» PN» where N is the number — en 
| cognition by machine involves techniques for assigning ie жЕ т 
| Tespective classes automatically and with as little human inte 
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possible. The two principal pattern arra 


i : (for quantitative descripti ngement 

М ptions) а : 5 use E. А 

- і vectors are represented by vin Mia (for structum Practj [ is the length (number of symbols) in the string representation 

i the n x 1 vector form. Wercase letters like descrip." 2 At. It can be shown that p = 0 if and only if x and y are identical. 
X y ang ; Pat, “je measure of similarity between x and y is the ratio 


Т В = ih ed e 
p ^ maxixLlyD-o ~(i) 


about the affine matching. 


there are two main approaches proposed 
sis and test. It 


Jn affine transformation, 
о match objects. The first alignment, also called hypothe: 


computes an affine transformation based on an hypothesized correspondence 
1 basis and then verifies the hypothesis by 


Ans. The decisi 
Ба 1810п theoreti 
of decision function, Let ar approaches to recogniti 
IB 


vector. Fo Xp, ..., X,)T 
feats rN pattern classes p п) represent an n-dimensi 
ic recognition is to find b Р» .-› Py, the basic problem 


Onal pattem 


the pri ; N decisi d i н “е 
Property that, if a pattem x Mace function dj (x), Фк), . E decisim between an object and mode 

dix)» d(x) о class рь then > ONC) wit tansforming the model to image coordinates and determining the fraction of 

model and image points brought into correspondence. This is taken as a measure 


uality of the transformation. Suppose an image I containing n feature 


ts, alignment consists of the following steps — 
ы for each model M. Let m be the number of model points 
15 for each triple of model points do 


' for each triple of image points do 
hypothesize that they are in correspondence and 
compute the affine transformation based on this correspondence. 


for each of the remaining m — 3 model points do 


apply that transformation 
Find correspondences between the transformed 


1 X) Or, H from „ IS oF 
equivalently, by values RS iln €: 


Ans. Strin Structural 
8S are the m methods, 
Рроѕе that two кү: approach 6 i model points and the image points. 
Xn and y, yp... у n boundaries xand in structural pattern “Measure the quality of the transformation oe | 
Б Уаге coded into strings (based on the number of model points that are paired with image points.) 


all possible groups of three model and image 


points, since it is known that they uniquely determine an affine transformation. 
The second method, geometric hashing or indexing isa table lookup method. 
It consists of representing each model object by storing information-invariant 


een the two зх." SPeCtive] 
Ti н VA 
DES Where a Let a represent the number 


bols that do h Occurs ; 
M NOt match C? іп the k position 
B= mal iy) gn p 


These steps are repeated for 
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information about it in a hash. This table i 


а : # оу i ? S co " В 
time, similar invariants аге extracted from the < ion — Insert а symbol ‘a’ into a string, denoted as À > a 
1 the table to find Possible instance of the model tl hie (0 Inserto bol 
^ ' : lo as geometric hashing for point set mate} 1€ ind 1s the null sym 0 | 
invariant. The coordinates оГа Point into a ref, e here р Deletion — Delete a symbol fom SER SPOSE аан о 
^ ‚ noncollinear points are affine inde a erence fr. l A Change — Change one symbol into another, denoted as a — b. 
| : Preprocessing phase and а recognition phase (i ive real cost function d(a — b) is assigned to each edit operation 
i mm 
Algorithm of Preprocessing Phase — y 


he cost of a sequence of edit operations that transforms X into Y is 
; [е um of the costs of the individual operations. The edit distance 
жы given p B iefined as the minimum of such total costs. Let E(i, j) be the 
м BO 2 cen the substrings ag, ....., а; and bos м» by. It is E(n, m) = 
Let E(0, 0) = 0; then EG, j), 0<i<n,0<j<m is given by 
E(i-1,j) * e(a; >A) 
E(i, j) = min 4 E(i, j-1)+e(A > bi) 
E(i-1,j-1)* e(a; > bi) 
iid ‘The matching problem can be seen as one of finding an optimal 
un ES asing path in the 2D table E(i, j) from the entry (0,0) to iia p" m 
"hen hie elements of the table are computed horizontally from each row to the 
наар jja Н Л E. ren computing E(i, j) the values that are needed have already 


for each model M. Let m be the 


for each triple of noncollinear model points d 
form a basis (reference frame) i d 
for each of the m-3 ini К 


determine the Point coord 


number of model M 


Recognition Phase — 


0.16. Write short note on shape matching. 
Ans. The first stage aligns the input shapes in the common 2 pie 
orresponding parts on different shapes can be easily же A is 
stage сап be divided into a global phase and a local phase. In the globa p Ee 
fe jointly compute an affine transformation Ti for raise 5 | кы “ч 
all shapes are roughly aligned | : с e cr - pon : жле 
" matching multiple shapes, st S р 
7 еуин кош a similarity graph G ämong the apes 
s along with associated relative transformations Тү, р (i, » n ki 
nd step jointly computes an affine transformation T; for each shap 


N n -l d 
an the induced transformations Tj oT; an 
izi sonsistency between the induce 
neon mizing the consistency 


OF concavity are consid 
' sidered, Т i 
matching problem, The String i thee Sem 


tions Tq; j Among existing formulations to this problem, 
‘ormulation, due to its ability to handle noisy relative 


Telative transforma 


x 


chi ; extend the MRF f arn ion reli ffectively sampling 
Let X = M iiv el algorithm is as follow sformations. The elficiency of this iq eres tei we nodes a 
redit n н! E W . fon space of each shape. To address this » ме нее 
dete es of edi Operations, Mp. 2» Dmi be two Strings of symbols e transformation space of oN wh is sufficient to provide an initial 
efined to transform X into Y У. insertion, detectio symbols. Teduced affine transformation в 


‘hich enables us to perform the MRF 
п, and change, are tarting point for the local phase, and which enables us to p 
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zero-mean and white, so that E(yyl } =I, and n is zero- 


is now easy to formulate the likelihood function 
an whi 1 . É ; А 

is Gaussian. Scaled eigenvectors of C, are 
m likelihood solution, in the limiting 


optimization for each type of 1D 

z і trans i 
plane) in a sequential manner. In this ЕНЕ, о 
per-shape in each subproblem. j 


*B. the р 
» WE only need to 


tation ; 

iri Ih the 
еа Ip NC 
acg 


In the local phase, we 
| ; ме proceed to joint] 
for each shape S; to improve ie ali 


Y optimize a free-fi 


gnment . Tom d 
- To avoid can 
ап 


Mation 


sis (FA). It is ca 
is is different from PCA. Factor analysis was originally developed 


ces and psychology. The model has the form of, with the 
tion that the elements of y are the unobservable factors. The elements 
Aare called factor loadings. The elements of the additive term n 
led specific factors, instead of noise. Let us make the simplifying 
jon that the data has been normalized to zero mean. 


| we assume that the elements of y (the factors) are uncorrelated and 


‘A 
and their variances can be absorbed into the unknown matrix A so 


incipal co we can assume 
m E - 
dimensional orthogonal vectors method) sen у, di mo. Efyy™} =I ...(ii) 
> TC. D Р 

es for k n. ents of n are uncorrelated with each other and also with the 
Q=E (nn). It is a diagonal matrix, but the variances of the 
generally not assumed to be equal or infinitely small, as in 
cipal FA. We can write the covariance matrix of the 


cien 


>й 
w 


elem 
; denote 
ements are 
al case of prin 
ons from equation (i) as 


(iii) 


n practice, we have a good estimate of C, available, given by the sample 
ince matrix. The main problem is then to solve the matrix A of factor 


5 and the diagonal noise covariance matrix Q such that they will explain 
served covariances from equation (iii). There is no closed-form analytic 


tion for A and Q. 
Assuming Q is kno 


E(xxT) = C, =AAT+Q 


wn or can be estimated, we can attempt to solve A 
AAT = C, - ©. The number of factors is usually constrained to be 
much smaller than the number of dimensions in the data, so this equation 
t be exactly solved; something similar to a least-squares solution should 
sed instead. Clearly, this problem does not have a unique solution — any 

ogonal transform or rotation of A — АТ, with T an orthogonal matrix (for 
Which TIT = 1), will produce exactly the same left-hand side. We need some 
tra constraints to make the problem more unique. 
- Nov, looking fora factor-based interpretation o 
A typically tries to solve the matrix A in such a way 


transforms are More suitable for 


odel AA used is'a distribut i 

generative latent variab] ven aa 

: \ T, РСА сап ' ive 
mtv tent variable model РОА Ж“ Be derived fiom 


xX=Ay+n Бы 


f the observed variables, 
that the variables would 
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have high loadings on a small number of factor 
remaining factors. The results are then easier 
been used in such techniques as varimax, qua 
There are some important differences 
Principal component analysis is not based on 
can be derived from one. It is a linear transfo 
variance maximization or minimum mean-s 
PCA model is invertible in the (theoretical) с 
all the principal components are retained. 
have been found, the original observations 


rtimax апд, рїї} ON the 
b Oblimin TOf, Ple hag 
etween РСА, FA ations 
a generative mo del, ain, [i 
aor that is based city it 
owe or Tepresentatig T Qn 
oi no compression б n. The 
Once the Principal bs >Le., Whe 


fi Тропе 
АВЕ вау expressed "i i 
i ; n " 
linear functionsas x = 5^ у, andalso the principal com : 


1 А PONents аге și 
obtained as linear functions of the observations — Simply 


Yi= wIx 


се latent variable model; the Observation 
the factors, but the values of the factors canori 
lj 


The FA model is a 


eigenvectors of C,; several different estimati 


FA, as well as PCA 


Of principa] S Ж 
Ans. The basic procedure of PCA ‘Pat component analysis (PCA). 


is as follows — - 


the sien are ea ee ne eal vectors that provi ; 

А put data. These are unit vectors that ea A ovide a basis for 
perpendicular to the others. These vectors are ге Point in а direction 
components. The input data are a linear combi to as the principal 


components, nation of the principal 
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ü decreasing 
e” or strength. The principal components essentially serve as a 
xes for the data, providing important 
e out variance. That is, the sorted axes 
a that the first axis shows the most variance Yi 
U the data, the second axis shows the next 
Д dd variance, and so on. For example, fig. 5.13 
T the first two principal components, Y, and 
sho for the given set of data originally mapped to 
Y» ев X, and X». This information helps identify Fig. 5.7 
aps or patterns within the data. 
i i (iv) Because the components are sorted according to decreasing 
order of “significance”, the size of the data can be reduced by eliminating the 
weaker components, that is, those with low variance. Using the strongest 
principal components, it should be possible to reconstruct a good approximation 
ofthe origin al data. 


- 0.20. Explain PCA with example. 


Ans. Eigenvalues and eigenvectors are the two most important concepts 
inPCA. We think the matrix as a space, then for a symmetric positive definite 
matrix A, its eigenvectors are orthogonal to the space formed by A. The larger 
the eigenvalue is, the more information corresponding eigenvector can have 
on this basis. For example, we select the feature values of the 10 largest 


eigenvalue of an image to reconstruct the image as shown in fig. 5.8, the 
posed into 10 images (each feature vector corresponds to one 


X) 


graph is decom| 
image). 


. 5 ? Жу 


ages Corresponding 


to the First 10 Principle Eigenvector 
Fig. 5.8 The 10 In 

; If the 10 images are superimposed, we 
difference to the original one, the space compose 
ains 9596 energy of the original image. 


can get a picture with not much 
d by the 10 feature vector 


cont 
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The idea of PCA is to reduce the 
make the variance of the data distribut 
make the largest variance in the data dis 
The following steps will show how t 


to select these directions — 


(i) Moving the data cen 


that is, all the sample р 
obtain the normalized data, 


(ii) Assume that the s 


and now each sample point i 


necessary to multiply the project 
[U1, U2 


(iii) Now we get the co- 
I space, then we need to 


after dimension reduction is р у 
1 


dimensional 


ŽIT ¥', then the variance of the data di 


DSion to the gj 
um. Those dj 
the principy 
Ximum Varig 


ion maxim 
tribution are 
© make the ma 


ter to 0 co-o inate 
s i S and à 
oints X minus the mean and divided by nl 

e 


P, set the p 


ordinates of each sam 


ple point j 
compute the Variance, E 


Obviously, data 


i Ж 
Y , the value Y In step 2 can be c 


number of rows (or 
equivalently columns) we 


it features called featut 

ng. €, but it is considered 
Features extraction performs some tr; mation: i à 

generate other features that are more signi MEAM Of original features? 


~ cature extraction is general 


_ Advantages of Feature Extraction — 
of extracting the relevant features from la 


Feature extraction is the process 


t fee тре database for dimensionality 
duction. Feature extraction is a key process to reduce the dimensionality of 


medical dataset for efficient disease prediction. The feature extraction technique 
removes irrelevant features to acquire higher prediction accuracy during disease 
diagnosis. 

“In feature extraction, size of the feature space can often be decreased 
without losing a lot of information of the original feature space. 


à - 0.22. Explain the principle of SVD. 
mem B 


gius: Ans. Let us consider a second form of matrix analysis that leads to a low- 
dimensional representation of a high-dimensional matrix. This approach, called 
singular-value decomposition (SVD), allows an exact representation of any 
matrix, and also makes it easy to eliminate the less important parts of that 
representation to produce an approximate representation with any desired 
number of dimensions. 

‚___ We explore the idea that the SVD defines a small number of “ concepts” 


‘that connect the rows and columns of the matrix. 
^ LetM bean m x n matrix, and let the rank of M be r. Recall that the rank 


Of a matrix is the largest 


can choóse for which no ' 
nonzero linear combi- 


Ken] fre] гы) а 
nation of the rows is the 
all-zero vector 0 (we say 


U N 
1 
a set of such rows or 


columns is independent). Fig. 5.9 The Form ofa Singular-value Decomposition 
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Then, we can find matrices U, Z, and V Unit-V 161 


properties — as shown in fig, 5 


jdeas behind SVD 5 
high dimensional, 
eset of data points and 
a lower dimensional 
ses the substructure 

more clearly and 
n most variation to the 
es SVD practical for | 
cations is that we can 
more variation below a 0 
shold to massively 1 2 3 4 5 
but be assured that the Fig. 5.11 Regression Line Along Second 
tionships of interest have Dimension Captures Less Variation in 
ved. Original Data 


f. How does a neural network work ? Explain. 

‘A neura! network can be thought of as a black box that transforms 
rector x to the output vector y, where the transformation performed 
ї of the pattern of connections and weights, that is, according to 
of the weight matrix W. 

Sider the vector product 

4 x*w = Xx;Wi 
is a geometric interpretation for 
ct. It is equivalent to projecting one f | 


го 2) the other vector in n-dimensional Fig. 5.12 Vector Multiplication | 


(0 Uisanmxr column-orthono 
columns is a unit vector and 


(ii) Vi 


rmal matrix. 
the dot Product of IN, th 


at ig 


trix; that is © orthonor 


0 


w 


ation 
ation 


RN. is like Vector Projection : 
His notion is depicted in fig. 5.12 for 


Yo-dimensional case. 

1e magnitude of the resultant vector is given by 

| x*w = |х| |w] cos.8 

kk] denotes the norm or length of the vector x. Note that this product is 
m when both vectors point in the same direction, that is when 0 = 0. The 
is a minimum when both point in opposite directions or when 0 = 180°, 
his illustrates how the vectors in the weight matrix W influence this 


S to the nodes in a neural network. 

Possible to use these re i | 
posite е these n Sesion i 0.25. Find all possible stable states of the neural network. 
uncorrelated data points that wil] ° 
Show Subgroupings in the 
Original data 


Ло! necessarily Fig. 5.10 pes, e, 


visible at first glance. Data from Ty, 


(R.GPK,, June 2013) 
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Processing elements, or units, are alw. 
inactive. In the given 


igure, unit filled p b 
are blanked are inacti E 


500 
7 INPUT 500 1000 
_ (First and Second Formants) FI (Hz) 

› We can 1 


1400 


i es. For example, the letter “x” і 
EET 


ith a back propagation network. 
Bs fii) Connectionist Vision — 


ith limited hardware. Only the ce 


oth the engineering and biological aspects of massively parallel vision, 


ation plays an important role in connectionist vision Systems. 
anslatin te ti a s. mmt BÉ neural units receive their initial activation levels 
сп attacked With neura] Netw, Xs Inta Speech rather om a video camera and then these activations are iteratively modified based 
Cre ii recognition, and high Performan, Bits Sp eech production 1 the influences of nearly units one use for relaxation is detecting edges, If 
y edu. OAK that learn to Pronounce English tex, ата ате available P ny units think they are located on an edge border, they can override any 
| „ystems to demonstrate that Connectionist m thod xt, Was Опе of the first | many 
^ "World tasks. Sthods could Бе 


ў issenters 
s aPplied to real- puisse 
"Vf йу 
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d BUrPUF ue DECISION REGIONS 
| P h of Ten Vowels 4000 
è Ive. Units are con i B. E. 
Connections, A Positively Weighted connec | 
to activate each other. A negative connection p 
a neighbouring unit 


ranslation of text into 
Х 15 usually pronounced 
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i lications 
iefly explain the app What are the 
e 7. E, Aedia ] network? .. (FG. Dec, 2999) 
em E Applications — Refer 0.26, 


of neural networks. 


network js different from the 


needs to be emulated. 
(0) Requires high Processing time for large neural networks, 


hitecture of the microprocessor therefore 


I network in medical field E. " 
(R-GP.K,, Nov, 2018) еты 
l field are as follows — 


ed for the analysis of 
proved to be 100% with an 8% 


cult differential diagnosis 
between Alzheimer disease and vascular dementia can be assisted by neural 
i re Э 2 
Е Mat each unit represents so Prefer to be active, 
particular city in 


Network analysis of brain SPECT image data. ао 
| — Pulmonologists and radi 
to Poth iti 1 iv) Pulmonology — Pu | 
1 a short tour To find that tour we pus p a fad ome n 1А ether Lane development of a system for the classifi 
Tandom activation values, Once ај the we; hts Out by giving our unis} ор 
themselves asynchronously acc are 
until a stable s 


cation of solitary 


dules. According to their results, neural network analysis of 
_ Pulmonary no i 


: i successful than conventional classification methods. 
А Network kn cwm | 4 ы уеге е accurate than 2 well trained experts 
n Constraints are Minimal, 27 fos for the diagnosis of pulmonary embolism. u ИМЕНИ 
Other tasks Successfully tackled by neural networks, į l | (v) Radiology — To date, the application of neural ne 
| Play backgrammar to classify sonar Signals, to compress ; nc ude 
| а vehicle along a road. Mages 


f ost interesting and most powerful in the field of radiology. Images contain 
» and to driv} рет 
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much information and th 
à ey are so comp]; 
plicated that ; 
at ір, 
mne development of hardware have empowered machines with more 
y^ onal power the convolutional networks are going towards deep 
providing better results. For the object detection purpose the most 


noise filtering and for iti 
recognition of А b ; 
unusual] images. 8 €Comes SUitaby ФП 
ef К : 
0 тё used deep learning models consist of the recurrent convolutional 


nvolutional Neural Networks (CNNs) are much successful in various 


and laboratory fi 
ndings, a neura] 
of liver diseases. Th i Qs OH was Creat was 
: e s On 
kasan ct : network achieved a нен to diagnose five hi nvoluti = 
of residents and those чэш lon accuracy so Classe learning and computer vision problems. These are used in variety of 
1 ^ теу ‘recognition and robotic field. There are a number of reasons that 


onal neural networks (СММ) have been applied with great success 
age recognition of objects in the air from the past few years. From 


spendencies are. Thus, compared to standard feed-forward neural 
orks with similarly-sized layers, CNNs have much fewer connections 


Application Field 
Ч = ecture performance. 
msequently, the typical architecture of CNNs is a multilayer stack 
iple modules such as convolutional layer, pooling layer and fully- 


ted layer. Starting with the raw input, each module transforms the 


Diagnostics, p, = 
. Ics, Prognosti 
Diagnostics F 


Intensive care 


Gastroenterology Prediction 
On ABD Prediction 
olo i : E | : | 
Paedia 4 д ту 2105008 entation at one level into a higher and more abstract level. Meanwhile, 
Neurolo рН, Prognostics Cognition tasks, higher ranking of dataset representations an increase 
EEG nd ee ects of the input which are important for discrimination and limit 
Otology-Rhinolo n" Processing, Modelling { lated of variations. 
Obstetrics and e yngology Si ae The deep learning technique based on convolutional neural network has 
Ophthalmolo ynaecology Dara Processing, Modelling eved great performance improvement in large-scale image classification 
Radiology P Siena s and set off the upsurge of deep learning besides it is a new hot spot in 
Clinical chemis Signal Processing, Modelling domain of pattern recognition. It allows a model that consists of multiple 
Patholo чу Si Processing (X-ray, US CT) cessing layers to study data representation with various levels of abstraction. 
Cytol и м Processing Dia, 2 21 hermore, in deep learning techniques, besides data formation, transfer 
бепе m omms earning is useful when someone wants to train on their own dataset for various 
B eil asons, for example, the dataset may not be enough to train the full neural net 
Iochemistry hi MM problems in transfer learning. 
3 4 e ing ma ;ed to take a pre-trained deep 
Пу, transfer learning may be use 1 
4 M c replacing the fully-connected layers (and potentially the last 
[та] networ $ ose layers on the related dataset. Nevertheless, 


1 ‘onal layer) and training th i 
а that deep neural networks (DNNs) easily suffer from 
it has been In this review, the technique of machine learning 


over fitting with small samples. 
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- is important to ensure the quality and efficiency ofi 
1 verification and clustering that want to train m 
1mage recognition by using the accurate learni 


mage i 
Ore effecti 
ng machin 
Q.30. Explain in detail about the convolutio 


Ans. The convolutional layer is the core buildi 
neural network and aims to resolve the limitations 


n lerm 

S of с 
Veness eben 
es Method. 
n layer, 


(a) 3D 


ене. by u and v, r in horizontal 
4 P CKward pass, w 
function with respect to the Wi 5 calculate the partial derivatives of the loss 


eights a i 
74 biases of the respective layer as in 


VOL = 

wf >; Уч) Г, Кочар) 
У (е £ = 

b ) i Уу) р 


uv 
where, W denotes a spatially flippe 


L d filter in 
correlation rather than a convolution. оше Compute the cross- 


~ network 1 


- fully conr 


jl What do you mean by pooling layer ? Explain. 
, s a other important concept of convolutional networks is pooling, which 


f non-linear downsampling. The pooling layer partitions the input 
t of non-overlapping rectangles and, for each subregion, outputs 


other common pooling operation is average pooling, which computes the 
Ant the activations in the previous layer rather than the maximum. The 
he pooling layer is to progressively reduce the spatial size of the 


Presentation to reduce the amount of parameters and computation in the 
network, and hence to also control overfitting. It is common practice to 


odically insert a pooling layer in between successive convolutional layers. 


(b) Local 


(a) Global 
Fig. 5.18 Max-pooling Layer 
During the forward pass, the maximum of non-overlapping regions of the 
previous activations is computed as in 
| x = max[x 4-9 ).. 


u,v 


ince the pooling layer does not have any learnable parameters, the 
ward pass is merely an upsampting operation of the upstream derivatives. 
Case of the max-pooling operation, it is common practice to keep track of 
he index of the maximum activation so that the gradient can be routed towards 
5 origin during back propagation. 


| rameters of the pooling layer are its stride and filter size. Since 


— The hyperpa t 
they have to be chosen in accordance to each other, they can be interpreted as 


"the amount of downsampling to be performed. 


P Q.32. What is fully connected layer ? Discuss its limitation. 


Ans. The fully connected layer is a synonym often used in the convolutional 
iterature and is equivalent to a hidden layer in a regular artificial 
t is sometimes referred to as linear or affine layer. Intuitively, the 
nected layer is responsible for the high-level reasoning in a 
1 network and is therefore typically inserted after the 
The neurons have full connections to all activations in 


- network. 1 


convolutional neura 
convolutional layers. 


Unit-V 169 


Scanned with CamScanner 


170 Image Processing and Computer Vision 
Unit-V 171 
yperparameter in a fully connected layer isthe number of output 
Е the input connects to, i.e. how many learnable Parameters connect 
fe input to the output. 
те таш limitation of the fully connected layer is the assumption that each 
igput feature, i.e. pixel in the image, is completely independent of neighbouring 
Js and contributes equally to the predictive performance. However pixels 
are close together have the tendency to be highly correlated and is the 
structure of images has to be taken into account. Additionally, fully 
ected layers do not scale well to high dimensional data Such as images 
ach pixel of the input has to be connected to the layer's output with a 


le parameter. 


the previous layer, as seen in a regular artificia] neural nety, k 
Ork as 
show 


fig. 5.19. н. 
In 


' 
LI 


Input 
Layer 


33. What is machine learning ? 


пз, Machine learning is a branch of science that deals with programming 
fhe systems in such a way that they automatically learn and improve with 
rience. Here, learning means recognizing and understanding the input data 
id making wise decisions based on the supplied data. 
__ Itis very difficult to cater to all the decisions based on all possible inputs. 
i а ckle this problem, algorithms are developed. These algorithms build 
owledge from specific data and past experience with the principles of 
atistics, probability theory, logic, combinatorial optimization, search, 
forcement learning and control theory. 


Layer L 
(b) Two Hidden гае? The developed algorithms from the basis of various applications such as— 
Fig. 5.19 Fully С, ; үз " 4 i) Vision processin (ii) Language processing 
ihe айлы габ. Layers in an Artificial Neural Network ' fine (eg, stock market trends) 
product of the weights n а layer eam be computed. with a dot E (iv) Pattern recognition (v) Games 
offset as in previous layer activations followed by a bias EU (vi) Data mining (vii) Expert systems 
(viii) Robotics. 


x = (w)T x ^-D 14 


Q.34. Write and explain different types of machine learning. 


(é-1) ee 
but К UON ofthe previous | Ans. The different types of machine learning are shown in fig. 5.20. 


ayer and x wO). and 
Machine Learning 


7 d | ' 
b© denote ће activations, weights and biases of the current layer, respectively 


During the backward pass, the i i ч 
Р > gradient with res ; 
biases are computed as in Pect to the weights and 


УО L = (x)T уез) г) rant 


Reinforcement 


Semi-supervised 
Learning 


Learning 


Unsupervised 
Learning 


Supervised 
Learning 


2 . n . 
VOL = > (Vy (6+1) £r ui Fig. 5.20 Types of Machine нал О 
i- 1 : Supervised Learning — 10 this type of learning, the machine is 
i=] ; (i) р і | ‘ B 
where V. (r4) denotes the upstream derivatives i provided with a given set of inputs with their desired outputs. The machine 
x + 5. 
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needs to study those given sets of inputs and out 


Unit-V 173 
that maps inputs to desired outputs. 


E before providing as input to respective algorithms. Thus, it has a 
ant impact on results to be achieved or obtained. 


puts and 
finda Бепега| fun 
Clion 


(ii) Unsupervised Learning — This lype of fe 


i 4 S ч ў . aming je А " 
learned by its own' by discovering and adopting, based on the: 'S termeg as (ii) As we have one more term interpretation. That it result is also a 
In this learning the data are divided into different clusters and he Input Pattern challenge. That need to determine the effectiveness of machine learning 
T Р is called а clustering algorithm. "Ice the leary ing ims 


(iii) Semi-supervised Learning — This fearnin g is used f; (iii) We can say uses of machine algorithm is limited. Also, it's not 
or t 


i applications as supervised learning. But it uses both labeled and he same any surety that it’s algorithms will always work in every case imaginable. 

li і for training, This type of learning can be йвей with mer labeled data уе have seen that in most cases machine learning fails. Thus, it requires 
саа regression and Prediction. Semi-supervised learning ae as ia ding of the problem at hand to apply the right algorithm. 
Mines : со ad en 3 15 too high to allow for a ful bi "wi (iv) Like deep learning algorithm, machine leaming also needs a lot of 

: ‘ ess. Early examples of this j e identify; abel / it mi i 

: 7 Pn iin y р Is include identifying a Person’s fac К data. As we can say it might be cumbersome to work with а large amount 


Fortunately, there are a lot of training data for image recognition purposes. 


(у) One notable limitation of machine learning is its susceptibility to 
errors. Brynjolfsson and McAfee said that the actual problem with this inevitable 
fact, That when they do make errors, diagnosing and correcting them can be 


difficult. As because it will need going through the underlying complexities. 


with the objective to maximize th i 
А € efficiency (performan i 
learning process ensures less participati ha dne 


| on of human expertis 2: 

ess | е and save 

time. Reinforcement learning is often us à i se 
With 


ed for robotics, gaming 
which actions yield the greatest rewards. Pee elal and 2 


the algorithm discovers 
Q.35. Write the advan 
Ans. The five advanta, 


Ontinual 


0.37. Explain the applications of machine learning. 


- Ans. (i) Computer Vision — Many current vision Systems, from face 
- recognition systems, to systems that automatically classify microscope images 
of cells, are developed using machine learning, again because the resulting 
Systems are more accurate than hand-crafted programs. One massive-scale 
application of computer vision trained using machine learning is its use by the 
IS Post Office to automatically sort letters containing handwritten addresses. 


tages of machine learning. 
ges of machine learning are as follows — 


achine leaming uses data to discover 
T your problem, As you collect 


decision making engine fo the optimal 


" но à accuracy " increase automatically, mordata, the _ Over 85% of handwritten mail in the US is sorted automatically, using handwriting 
a i) Automated —Asan ; ; - analysis software trained to very high accuracy using machine learning over a 
"m learning model сап ears Swers are validated or discarded, the machine ух иш d 


Very large data set. 


ae (ii) Speech Recognition — Currently available commercial systems 

for speech recognition all use machine learning in one fashion or another to 
- frain the system to recognize speech. The reason is simple — the speech 
_ fécognition accuracy is greater if one trains the system, then if one attempts 
to program it by hand. In fact, many commercial speech recognition systems 
involve two distinct learning phases — one before the software is shipped 
(training the general system in a speaker-independent fashion), and a second 
phase after the user purchases the software (to achieve greater accuracy by 
training in a speaker-dependent fashion). 

(iii) Bio-surveillance — A variety of government efforts to detect and 
track disease outbreaks now use machine leaming. For example, the RODS 
project involves real-time collection of admissions reports to emergency rooms 
across western Pennsylvania, and the use of machine learning software to learn 


" (iii) Fast — Machi 
milliseconds as new data streams in, allowin 


Я le — Many data-driven 
. e learning model 


(V) Scalable — As your busi 
n i ; i 
Scales to handle increased data rates, Sulle ie адо easiy 


scale to handle large amounts of data on many mac, Seming algorithms can 


Q.36. Write the disadvantages of machine learning 


learning are ag foll 

. 7 H ow Ds 
(i) Machine learning has the major challenge calle а 
based on different algorithms data need to be processed 


Ans. The disadvantages of machine 


d acquisition, Also 
2j And, it must be 


symptoms and their geographical distribution. Current work involves adding in 


I 


the profile of typical admissions so that it can detect anomalous patterns of ` 
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arich set of additional data, such as retai 
i i | il purchases 
» a 3 the information flow into the system. е 
mated leaming methods given this even more comple igi 
€x data se 
t 


(iv) Robot Control — Machi i 

oe used in a number of robot аа тіпа Method 
eles ша тары ы the use of machine аник oe 
жыш le helicopter flight and helicopter enc in 
aon. о involving a robot driving n at 
саа ` esert was won by a robot that used d 
ue ty to detect distant objects (training it e 
sisting of terrain seen initially in the ве pus = 
э se 


the 
“Counter me dici 
ШЫ 


һе need for 


> Sever, 

; a 
quite contro 
108. The Tecent 


за for over 
ne learning t 
m self-collectog 
en later up Close 
at Which involves 
guage and its нне 
ol of legislation or 


t ich is in short supply. Dis 
sn helping radiologists to make intellectual 
0.38. What are machine learn Е, е 

dis tesis е ча» ning tools ? Explain. 
data into actionable information 


prediction and diagnosis j 
dic gnosis is achi 
decisions with radiology rein А 


shortening the time. 

The ae learning tools are as follows 
| i) Platforms — PI * 
project from beginning to en ame are use 

(a) Provid NER 
learning project. vide capabilities re 
(b) The interface ma . 
К у be graphical 

(С) They provide a lose coupling Ark command lits 

tuc (d) They are provided for general purpo get 
er than speed, scalability or accuracy. se use a 


d to 
complete machine learning 


quired at each Step in a machine 


nd exploration 
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— Library gives capabilities for completing part of a 


a specific capability for one or more steps in a 


| (а) Provide 
earning project. ar 
_ (b) The interface is typically ап application programming 
requiring programming. 
a specific use case, problem type or 


They are tailored for 


(c) 


i) Graphical User Interfaces — 
(a) Allows less-technical users to work through machine 


(b) Focus on process and how to get the most from machine 


(с) Stronger focus on graphical presentations of 


| visualization. 
(d) Structured process 


(iv) Command Line Interface — 
(a) Allows technical users who are not programmers to work 


achine learning projects. 
(b) Frames machine learning tasks i 


tput to be generated. 
(c) Promotes reproducible results by ri 

nds and command line arguments. 
(v) Application Programming Interfaces — 

(а). To incorporate machine learning into our own software projects. 

н (b) To create our own machine learning tools. 

(c) Gives the flexibility to use our own processes and 
tomations on machine learning projects. 
: (d) Allows combining our own methods 
e library as well as extending provided methods. 
(vi) Local Tools — Local tools can be downloaded, installed and run 


local environment. 
(a) Custo 


(b) Contro 
(c) Integrat 

(vii) Remote Tools — Remote to 
called from local environment. These tools are о 
learning as à service (MLaaS). 
(a) Tailored for 
(b) Run across m! 


imposed on the user by the interface. 


n terms ofthe input required 


ecording or scripting 


a 


with those provided 


data and algorithms. 
on and parameterization. 


e into our own systems to meet our needs. 


ols can be hosted on a server and 
ften referred to as machine 


mized for in-memory 
1 over run configurati 


scale to be run on larger datasets. 
ultiple systems, multiple cores and shared memory. 
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Q.39. Write and explain scope of machine learning, 


Ans. The scope of machine learning are as follows — 


(i) Explaining Human Learning — А mentioned earlier, 
learning theories have been preceived fitting to comprehend features 
in humans and animals. Reinforcement learning algorithms es 

| dopaminergic neurones induced activites in animals during re 
| learning with surprising accuracy. ML algorithms for uncovering 
sporadicdelineations of naturally appearing images predict visual features detected 
in animals initial visual cortex. Nevertheless, the important drivers in human or 
animal learning like stimulation, horror, urgency, hunger, instinctive actions 


| and learning by trial and error over numerous time scales, are Not yet taken 


Machine 
Of learn; ng 
timate the 
Ward-baseq 


Dew programming language that is self-sufficient to Support manuali 
subroutines as well as those defined as “to be learned", Programming languages 
like Python (Sckit-learn), R etc. already making use of this Concept in smaller 
scope. But a fascinating new question is raised as to develop a model to define 
relevant learning experience for each subroutines tagged as “to be learned", 


any unforeseen modification to the program's 
| function. 


| (iii) Perception — A generalised concept of computer perception that 
i can link ML algorithms which are used in numerous form of computer 
perception today including but not limited to highly advanced vision, speech 
recognition etc., is another potential research area. One thought-provoking 
problem is the integration of different senses (e.g., sight, hear, touch etc.) to 
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